Optimizing redeployment of functions and services across multiple container platforms and installations

ABSTRACT

A method of distributing microservice containers for a service across a plurality of computing environments may include receiving a service that is built from a plurality of containerized microservices to be deployed in a container platform. The container platform may include a plurality of computing environments. The method may also include receiving a deployment criteria for deploying the service in the container platform; accessing characteristics of the plurality of computing environments; and deploying the plurality of containerized micro services across the plurality of computing environments based on the deployment criteria and the characteristics of the plurality of computing environments.

CROSS-REFERENCES TO RELATED APPLICATIONS

This application claims the benefit of U.S. Provisional Application No. 62/566,351 filed on Sep. 30, 2017, which is incorporated herein by reference. This application is also related to the following commonly assigned applications filed on the same day as this application, each of which is also incorporated herein by reference:

-   -   U.S. patent application Ser. No. ______ filed on September         ______, 2018, titled API REGISTRY IN A CONTAINER PLATFORM FOR         AUTOMATICALLY GENERATING CLIENT CODE LIBRARIES (Attorney Docket         No. 088325-1090745);     -   U.S. patent application Ser. No. ______ filed on September         ______, 2018, titled API REGISTRY IN A CONTAINER PLATFORM         PROVIDING PROPERTY-BASED API FUNCTIONALITY (Attorney Docket No.         088325-1090746);     -   U.S. patent application Ser. No. ______filed on September         ______, 2018, titled DYNAMIC NODE REBALANCING BETWEEN CONTAINER         PLATFORMS (Attorney Docket No. 088325-1090747);     -   U.S. patent application Ser. No. ______filed on September         ______, 2018, titled REAL-TIME DEBUGGING INSTANCES IN A DEPLOYED         CONTAINER PLATFORM (Attorney Docket No. 088325-1090753);

BACKGROUND

In the abstract, containers in any form represent a standardized method of packaging and interacting with information. Containers can be isolated from each other and used in parallel without any risk of cross-contamination. In the modern software world, the term “container” has gained a specific meaning. A software container, such as a Docker® container, is a software construct the logically encapsulates and defines a piece of software. The most common type of software to be encapsulated in the container is an application, service, or microservice. Modern containers also include all of the software support required for the application/service to operate, such as an operating system, libraries, storage volumes, configuration files, application binaries, and other parts of a technology stack that would be found in a typical computing environment. This container environment can then be used to create multiple containers that each run their own services in any environment. Containers can be deployed in a production data center, an on-premises data center, a cloud computing platform, and so forth without any changes. Spinning up a container on the cloud is the same as spinning up a container on a local workstation.

Modern service-oriented architectures and cloud computing platforms break up large tasks into many small, specific tasks. Containers can be instantiated to focus on individual specific tasks, and multiple containers can then work in concert to implement sophisticated applications. This may be referred to as a microservice architecture, and each container can use different versions of programming languages and libraries that can be upgraded independently. The isolated nature of the processing within containers allows them to be upgraded and replaced with little effort or risk compared to changes that will be made to a larger, more monolithic architectures. Container platforms are much more efficient than traditional virtual machines in running this microservice architecture, although virtual machines can be used to run a container platform.

BRIEF SUMMARY

In some embodiments, a method of distributing microservice containers for a service across a plurality of computing environments may include receiving a service that is built from a plurality of containerized microservices to be deployed in a container platform. The container platform may include a plurality of computing environments. The method may also include receiving a deployment criteria for deploying the service in the container platform; accessing characteristics of the plurality of computing environments; and deploying the plurality of containerized micro services across the plurality of computing environments based on the deployment criteria and the characteristics of the plurality of computing environments.

In some embodiments, a non-transitory, computer-readable medium may include instructions that, when executed by one or more processors, causes the one or more processors to perform operations including receiving a service that is built from a plurality of containerized microservices to be deployed in a container platform. The container platform may include a plurality of computing environments. The operations may also include receiving a deployment criteria for deploying the service in the container platform; accessing characteristics of the plurality of computing environments; and deploying the plurality of containerized micro services across the plurality of computing environments based on the deployment criteria and the characteristics of the plurality of computing environments.

In some embodiments, a system may include one or more processors and one or more memory devices comprising instructions that, when executed by the one or more processors, cause the one or more processors to perform operations including receiving a service that is built from a plurality of containerized microservices to be deployed in a container platform. The container platform may include a plurality of computing environments. The operations may also include receiving a deployment criteria for deploying the service in the container platform; accessing characteristics of the plurality of computing environments; and deploying the plurality of containerized micro services across the plurality of computing environments based on the deployment criteria and the characteristics of the plurality of computing environments.

In any embodiments, any or all of the following features may be included in any combination and without limitation. The plurality of computing environments may include a cloud computing environment. The plurality of computing environments may include an on-premise computing environment. At least one of the plurality of containerized micro services may include a plurality of containers in a single container pod. At least one of the plurality of containerized micro services may include a single container comprising a plurality of micro-services in a single container pod. The deployment criteria may include an indication that the service deployment should be optimized for security. The deployment criteria may include an indication that the service deployment should be optimized for availability. The deployment criteria may include an indication that the service deployment should be optimized for cost. The deployment criteria may include an indication that the service deployment should be optimized for latency. The deployment criteria may include an indication that the service deployment should be optimized for co-location of the containerized micro services. Each of the plurality of containerized micro services may include an attribute corresponding to the deployment criteria. Values for the micro service attributes may be matched with the characteristics of the plurality of computing environments. The plurality of containerized micro services may be deployed by a container platform scheduler. The plurality of containerized micro services may be deployed by an API registry. The API registry may be deployed as a service encapsulated in a container in the container environment. The API registry may be available to services in development in an Integrated Development Environment (IDE) and services already deployed in the container environment. The API registry may map service endpoints for the plurality of container pods to one or more API functions. The API registry may automatically generate client libraries for calling services to call the service.

BRIEF DESCRIPTION OF THE DRAWINGS

A further understanding of the nature and advantages of the present invention may be realized by reference to the remaining portions of the specification and the drawings, wherein like reference numerals are used throughout the several drawings to refer to similar components. In some instances, a sub-label is associated with a reference numeral to denote one of multiple similar components. When reference is made to a reference numeral without specification to an existing sub-label, it is intended to refer to all such multiple similar components.

FIG. 1 illustrates a software structure and logical arrangement of development and runtime environments for services in a container platform, according to some embodiments.

FIG. 2 illustrates a specialized computer hardware system that is specifically designed to run the embodiments described herein.

FIG. 3 illustrates a data organization that may be specific to the container platform used by some of the embodiments described herein.

FIG. 4 illustrates an API registry that can be deployed to the IDE and the production/runtime environment, according to some embodiments.

FIG. 5 illustrates the deployment of the API registry for use with the container platform at runtime, according to some embodiments.

FIG. 6A illustrates a flowchart of a method for deploying the API registry, according to some embodiments.

FIG. 6B illustrates a software structure of a container platform when the API registry is deployed using the flowchart in FIG. 6A, according to some embodiments.

FIG. 7A illustrates a flowchart of a method for registering a service with the API registry, according to some embodiments.

FIG. 7B illustrates a hardware/software diagram of the steps for registering an API with the API registry, according to some embodiments.

FIG. 8 illustrates examples of a graphical interface and a command line interface for browsing and selecting APIs that are registered with the API registry, according to some embodiments.

FIG. 9 illustrates a flowchart of a method for using a service and its corresponding function registered with the API registry, according to some embodiments.

FIG. 10 illustrates how a selection may be received by the API registry through the graphical interface of the CreateUser( ) function.

FIG. 11 illustrates an example of a client library generated automatically for a service by the API registry, according to some embodiments.

FIG. 12 illustrates an embodiment of a client library that accommodates dynamic binding between service endpoints and API functions, according to some embodiments.

FIG. 13 illustrates an embodiment of a client library that can marshal additional data to complete an input data set for a service call, according to some embodiments.

FIG. 14 illustrates a client library that can handle retries when calling a service, according to some embodiments.

FIG. 15A illustrates a method of providing API properties to the API registry, according to some embodiments.

FIG. 15B illustrates a hardware/software diagram of how a service can provide API properties to the API registry, according to some embodiments.

FIG. 16 illustrates a hardware/software diagram where a property is used by the API registry to deploy a service with high availability, according to some embodiments.

FIG. 17 illustrates a hardware/software diagram of a property that enforces end-to-end encryption through the API registry, according to some embodiments.

FIG. 18 illustrates a property for an API registry to implement usage logging for a service 1808, according to some embodiments.

FIG. 19 illustrates a hardware/software diagram of a property that can enforce an authentication protocol for a service, according to some embodiments.

FIG. 20 illustrates a hardware/software diagram for a property that enables runtime instantiation of a service, according to some embodiments.

FIG. 21 illustrates a hardware/software diagram of a property that implements a rate limiting function for a service, according to some embodiments.

FIG. 22 illustrates an example of a service that is built from a plurality of micro-services, according to some embodiments.

FIG. 23 illustrates a deployment environment for the service, according to some embodiments.

FIG. 24 illustrates a flowchart of a method for optimizing service deployment across a plurality of computing environments, according to some embodiments.

FIG. 25 illustrates a diagram of service criteria and environment/microservice attributes, according to some embodiments.

FIG. 26 illustrates an example of a service deployment that is optimized to minimize operating costs, according to some embodiments.

FIG. 27 illustrates an example of a service deployment that is optimized for service availability, according to some embodiments.

FIG. 28 illustrates an example of a service deployment that is optimized for security concerns, according to some embodiments.

FIG. 29 illustrates an example of a service deployment that is optimized for processing speed, according to some embodiments.

FIG. 30 illustrates an example of a service deployment that is optimized according to multiple criteria simultaneously, according to some embodiments.

FIG. 31 illustrates a simplified block diagram of a distributed system for implementing some of the embodiments.

FIG. 32 illustrates a simplified block diagram of components of a system environment by which services provided by the components of an embodiment system may be offered as cloud services.

FIG. 33 illustrates an exemplary computer system, in which various embodiments may be implemented.

DETAILED DESCRIPTION

Described herein, are embodiments for an Application Programming Interface (API) registry that is part of an Integrated Development Environment (IDE) that allows developers to register services during development and make those services available to other services both during and after deployment. The API registry can be deployed as part of an orchestrated container platform, operating as a containerized application on the container platform. As services or microservices are developed and deployed into containers on the container platform, the API registry can execute a discovery process to locate available endpoints (e.g., IP addresses and port numbers) within the container platform that correspond to available services. The API registry can also accept an upload of an API definition file that can be used to turn the raw service endpoint into an API function made available through the API registry. The API registry can dynamically bind the discovered endpoint to an API function that be kept up-to-date and made available to other services in the container platform. This provides a stable endpoint that other services can statically call while the API registry manages any changes to the binding between the API function in the service endpoint. This also simplifies the process for using services in the container platform. Instead of writing code for an HTTP call, new services can simply use the API interface to access registered services.

In some embodiments, the IDE can provide a navigation/browse interface for developers to locate services that are available in the container platform and registered with the API registry. When calls to existing services are created by the API registry for new services under development, the API registry can automatically generate a set of client libraries that include all the necessary functionality to interact with the registered service. For example, some embodiments may generate an object class that includes member functions corresponding to API calls. During development, new services can simply instantiate these objects and/or use their member functions to make a call to the corresponding API. The code in the client libraries governs a direct connection between the calling service and the endpoint of the registered service and may include code that handles all the functionality necessary for this interaction. For example, the automatically generated client libraries may include: code for packaging and formatting parameters from the API call into an HTTP call to the service endpoint, code for marshaling data to complete parameter sets for the call, code for packaging information into a compatible packet (JSON, XML, etc.), code for receiving and parsing result packets, code for handling retries and error conditions, and so forth. From the calling service's perspective, the code to handle all of this functionality is automatically generated by the API registry and therefore abstracts and encapsulates the details of the service call into the client library object. All that is required of the calling service is to execute a member function of the client library object created by the API registry.

In some embodiments, the API registry can also accept an upload of a set of properties that may define the runtime execution of the registered service. This set of properties can be uploaded during development along with the API definition file. These properties can define runtime characteristics, such as end-to-end encryption, usage/logging requirements, user authentication, on-demand service instantiation, multiple service deployment instances for high availability, rate/usage limiting, and other runtime characteristics. The API registry can ensure that these properties are met by interacting with the container environment during development, during deployment, and during runtime. During development, the automatically generated client libraries for calling services can include code that may be required to execute these properties, such as encryption code, usage logging code, and/or interaction with a user authentication service. When a registered service is being deployed, the API registry can instruct the container platform to instantiate multiple instances of the service and/or additional load-balancing modules to ensure high reliability of the service during runtime. During runtime when a service is called, the API registry can cause the service to be instantiated for on-demand instantiation, limit the number of API calls that can be made to throttle usage, and perform other runtime functions.

FIG. 1 illustrates a software structure and logical arrangement of development and runtime environments for services in a container platform, according to some embodiments. The environments may include an IDE 102 that may be used to develop services and microservices to be deployed on a container platform. An IDE is a software suite that consolidates and provides all of the basic tools that service developers can use to write and test new services. The IDE 102 may include a source code editor 106 with a graphical user interface (GUI), code completion functions, and navigate/browse interfaces that allow a developer to write, navigate, integrate, and visualize the source-code-writing process. The IDE 102 may also include a debugger 110 that includes variable interfaces, immediate variable interfaces, expression evaluation interfaces, memory content interfaces, breakpoint visualization and functionality, and other debugging functions. The IDE 102 may also include a compiler and/or interpreter 108 for compiling and running compiled machine code or interpreted byte code. The compiler/interpreter 108 can include build tools that allow developers to use/generate makefiles another build automation constructs. Some embodiments of the IDE 102 may include code libraries 112 that include common code functions, objects, interfaces, and/or other structures that can be linked into a service under development and reused across multiple developments.

Services can be developed and thoroughly tested within the IDE 102 until they are ready for deployment. The services can then be deployed to a production/deployment environment 104. The production/development environment 104 may include many different hardware and/or software structures, including dedicated hardware, virtual machines, and containerized platforms. Prior to this disclosure, when a service 114 was deployed into the production/deployment environment 104, the service 114 would no longer have runtime access to many of the tools used in the IDE 102. Any functionality needed by the service 114 to run in the production/development environment 104 needed to be packaged from the code libraries 112 and deployed with the service 114 into the production/deployment environment 104. Additionally, the service 114 would typically be deployed without any of the functionality for the debugger 110 or a copy of the source code from the source code editor 106. Essentially, the service 114 would be deployed to the production/deployment environment 104 with all of the functionality required for runtime operation, but would be stripped of the information that was only used during development.

FIG. 2 illustrates a specialized computer hardware system that is specifically designed to run the embodiments described herein. By way of example, the service 114 can be deployed into an Infrastructure as a Service (IaaS) cloud computing environment 202. This is a form of cloud computing that provides virtualized or shared computing resources over a network. The IaaS cloud computing environment 202 may also include or be coupled with other cloud computing environments arranged as Software as a Service (SaaS) and/or Platform as a Service (PaaS) architetures. In this environment, the cloud provider can host an infrastructure of hardware and/or software components that were traditionally present in an on-premises data center. This hardware may include servers, storage, networking hardware, disk arrays, software libraries, and virtualization utilities such as a hypervisor layer. The IaaS environment 202 can be provided by a commercial source, such as Oracle® or other publicly available cloud platforms. The IaaS environment 202 may also be deployed as a private cloud using a private infrastructure of hardware and software.

Regardless of the type of cloud environment, the service 114 can be deployed onto a number of different types of hardware/software systems. For example, the service 114 can be deployed to dedicated hardware 206. The dedicated hardware 206 may include hardware resources, such as servers, disks, operating systems, software packages, and so forth, that are specifically assigned to the service 114. For example, a specific server may be allocated to handle traffic flowing to and from the service 114.

In another example, the service 114 can be deployed to hardware/software that is operated as one or more virtual machines 208. A virtual machine is an emulation of a computer system that provides the functionality of the dedicated computer hardware 206. However, instead of being dedicated to a specific function, the physical hardware can be shared by number of different virtual machines. Each virtual machine can provide all the functionality needed to execute including a complete operating system. This allows virtual machines having different operating systems to run on the same physical hardware and allows multiple services to share a single piece of hardware.

In a another example, the service 114 can be deployed to a container platform 210. The container platform differs from the virtual machines 208 in a number of important ways. First, the container platform 210 packages individual services into containers as described in greater detail below in FIG. 3. Each container shares a host operating system kernel, and they also share binaries, libraries, and other read-only components. This allows containers to be exceptionally light—often only a few megabytes in size. Additionally, a lightweight container is very efficient, taking just seconds to start versus the minutes required to boot up a virtual machine. Containers also reduce management overhead by sharing the operating system and other libraries that can be maintained together for the entire set of containers in the container platform 210. Even though containers share the same operating system, they provide an isolated platform, as the operating system provides virtual-memory support for isolation. Container technologies may include Docker® containers, the Linux Libcontainer®, the Open Container Initiative (OCI), Kubernetes®, CoeOS, Apache® Mesos, along with others. These containers can be deployed to a container orchestration platform, which may be referred to herein as simply the “container platform” 210. A container platform manages the automated arrangement, coordination, and management of deployed software containers. The container platform 210 can provide service discovery, load-balancing, health checks, multiple deployments, and so forth. The container platform 210 may be implemented by any publicly available container platform, such as Kubernetes, that runs containers organized in nodes and pods.

Regardless of the platform 206, 208, 210 on which the service 114 is deployed, each of the platforms 206, 208, 210 can provide service endpoints 212, 214, 216 that provide public access for calling the service 114. Generally, these endpoints can be accessed through an HTTP call and are associated with an IP address and a port number. By connecting to the correct IP address and port number, other services can call services deployed to any of the platforms 206, 208, 210 when they are made publicly available. Each service, such as service 114, may include its own proprietary formats and data requirements for calling the service. Similarly, each service may return results that are specific in format and data type to that service 114. In addition to the service-specific requirements, the particular deployment platform 206, 208, 210 may also include additional requirements for interacting with the service 114, such as programming languages, package formats (JSON, XML, etc.) that need to be complied with to properly interact with the service, and so forth.

Although the examples above allow the service 114 to be deployed to any of the described platforms 206, 208, 210, the embodiments described herein are specifically designed for the container platform 210 described above. Thus, embodiments that are specifically recited to be deployed in a “container platform” can be distinguished from other embodiments that are specifically recited to be deployed in a virtual machine platform, on the server or dedicated hardware platform, or generally in an IaaS environment.

FIG. 3 illustrates a data organization that may be specific to the container platform 210 used by some of the embodiments described herein. Generally, any deployment of a service to the container platform will be deployed to a pod 304, 306. A pod is an abstraction that represents a group of one or more application containers (e.g., Docker or rkt). A pod may also include some shared resources that are commonly available to all of the containers within the pod. For example, pod 304 includes container 310 and container 312. Pod 304 also includes a shared resource 308. The resource may include a storage volume or other information about how containers are run or connected within the pod 304. The pod 304 can model an application-specific logical host that contains different service containers 310, 312 that are relatively tightly coupled. For example, service 326 in container 310 can utilize the resource 308 and call service 320 in container 312. Service 320 can also call service 322, which in turn calls service 324, each of which are deployed to container 312. The output of service 324 can be provided to a network IP address and port 318, which is another common resource shared by the pod 304. Thus, the services 320, 322, 324, 326 all work together with the shared resource 308 to provide a single service that can be accessed by the IP address and port number 318 by services run in other containers. The service can also be accessed through the IP address and port 318 by computer systems that are external to the container platform, such as a workstation, a laptop computer, a smart phone, or other computing device that is not part of the container platform or IaaS environment.

In the simplest deployment, each container may include a single service, and each pod may include a single container that encapsulates the service. For example, pod 306 includes only a single container 314 with a single service 328. The single service is accessible through the IP address and port number 316 of the pod 306. Typically, when a service is deployed to the container platform, a container and a pod will be instantiated to hold the service. A number of different pods can be deployed to a container node 302. Generally, pods run within nodes. A node represents a worker machine (either virtual or physical) in the container platform. Each node is managed by a “master” that automatically handles scheduling pods within each of the nodes. Each node can run a process that is responsible for communication between the master and the node and for managing the pods in containers on the machine represented by the node. Each node may also include a container runtime responsible for pulling a container image from a registry, unpacking the container, and running the service.

FIG. 4 illustrates an API registry 404 that can be deployed to the IDE 102 and the production/runtime environment 104, according to some embodiments. As described above, a technical problem exists wherein when the service 114 is deployed from the IDE 102 to the production/deployment environment 104, the service 114 loses runtime access to information that is exclusively available in the IDE 102. The API registry 404 is accessible by the service 114 while it is deployed and operating during runtime in the production/development environment 104. The previous technical problem that isolated development functions from runtime functions is overcome by the API registry 404 by the registration of services with the API registry 404 during development and providing an API definition and/or API properties to the API registry 404. The information defining the API can be used by new services in development in the IDE 102 as well as services that have been deployed to the production/deployment environment 104. After this registration process is complete, the service 114 can operate using client libraries that access the API registry 404 during runtime to ensure that the API functions are correctly bound to the current IP address and port number of the corresponding service. The API registry 404 represents a new data structure and processing unit that was specifically designed to solve these technical problems.

Another technical problem that existed in the art was implementing service properties as they are deployed to the production/development environment 104. For example, if a service was to be deployed with high availability, the developer would need to build container deployment files that specifically instantiated multiple instances of the service in the container platform and balanced traffic in such a way that the service was always available. Service developers did not always have this expertise, nor were they often able to manage the deployment of their service. As described below, the API registry 404 allows a service to simply select properties, such as high availability, that can then be implemented automatically by the API registry 404. This technical solution is possible because the API registry 404 bridges the gap between the IDE 102 and the production/deployment environment 104.

FIG. 5 illustrates the deployment of the API registry 404 for use with the container platform 210 at runtime, according to some embodiments. One of the technical solutions and improvements to the existing technology offered by the API registry 404 is the maintenance of stable endpoints for service calls, as well as the simplification and automatic code generation for accessing the service calls. Prior to this disclosure, calls between services were point-to-point connections using, for example, an HTTP call to an IP address and port number. As services are updated, replaced, relocated, and redeployed in the container platform 210, the IP address and port number may change frequently. This required all services that called an updated service to update their IP address and port numbers in the actual code that called that service. The API registry 404 solves this technical problem by providing a dynamic binding between the IP address and port number of a service and an API function that is made available through the API registry. The client libraries that are automatically generated by the API registry 404 can include a function that accesses the API registry 404 to retrieve and/or verify a current IP address and port number for a particular service. Thus, a first service connecting to a second service need only perform a one-time generation of a client library to provide a lifetime-stable connection to the second service.

Another technical problem solved by the API registry 404 is the automatic generation of client libraries. Prior to this disclosure, a first service accessing a second service required the developer to write custom code for accessing the second service. Because this code could change over time, incompatibilities would a rise between the first and second services that required updates to both services. The API registry 404 solves this technical problem by uploading an API definition file that is used to automatically generate client libraries for calling services. Therefore, a service can specify specifically how the calling code in any other service should operate, which guarantees compatibility. These client libraries also greatly simplify and encapsulate the code for calling the service. As described below, a complicated HTTP call using IP address and a port numbers can be replaced with a simple member function call in a language that is specific to the calling service (e.g., Java, C#, etc.). This allows a calling service to select an API function from the API registry 404, and the code that implements at function can be downloaded to the calling service as a client library.

FIG. 6A illustrates a flowchart of a method for deploying the API registry 404, according to some embodiments. The method may include deploying the API registry service to the container environment (601). The API registry can be implemented as a service operating in the container environment within the container. Thus, the API registry can be actively running after services are deployed within the container environment such that it can be accessed at run time. The API registry can also be linked to the existing IDE described above. The method may further include discovering ports for available services in the container platform (603). As services are deployed to the container platform, the API registry can launch a discovery process that sequentially traverses each of the services deployed to the container platform. For each service, the API registry can detect and record an IP address and a port number. The listing of IP address and port numbers discovered by this process can be stored in a data structure, such as a table associated with the API registry. Each IP address and port number can also be stored with a name for the service or other identifier that uniquely identifies the service on the container platform. These initial steps shown in flowchart in FIG. 6A provide a starting point for the API registry to begin operating in the runtime environment of the container platform and to be available to services under development in the IDE.

FIG. 6B illustrates a software structure of the container platform 210 when the API registry is deployed using the flowchart in FIG. 6A, according to some embodiments. As described above, the API registry 404 can be deployed to a container 620 in the container platform 210. The container 620 can operate within one or more pods and within a node as described above in FIG. 3. The API registry 404 can be made privately available to any of the other containers in the container platform 210. In some embodiments, the API registry 404 can also be made publicly available to other devices that are not part of the container platform 210. As a containerized service, the API registry 404 may have an IP address and port number that are available to other services. However, the IP address and port number of the API registry 404 would only be used by the code that is automatically generated in client libraries, therefore some embodiments do not need to publish the IP address and port number for the API registry 404. Instead, the client libraries in the IDE itself can maintain an up-to-date listing of the IP address and port number for the API registry 404 such that it can be contacted during development, deployment, and runtime of other services.

After deploying the API registry 404 to the container 620, the API registry 404 can execute a discovery process. The discovery process can use a directory listing for nodes in the container platform to identify pods that implement services with an IP address and port number. The API registry 404 can then access a unique identifier, such as a number or name for each available service, and store an identifier with each IP address and port number in the container platform 210. This discovery process can be periodically executed to detect new services that are added to the container platform 210, as well as to identify existing services that are removed from the container platform 210. As described below, this discovery process can also be used to detect when an IP address and port number change for an existing service. For example, the API registry 404 can discover services having endpoints 602, 604, 606, 608. In the process described below, the API registry 404 can bind each of these endpoints 602, 604, 606, 608 to an API function that is registered with the API registry 404. At some point after this initial discovery, the IP address and/or port number for endpoint 602 may be changed when the service associated with endpoint 602 is replaced, updated, or revised. The API registry 404 can detect this change to endpoint 602 and update a binding to an existing API function provided by the API registry 44.

Similarly, the API registry 404 can use the discovery process to detect when endpoints are no longer available, and then remove the API functions associated with the service. In some embodiments, when a service has been registered with the API registry 404, but the corresponding API functions are not currently bound to a valid endpoint, the API registry 404 can provide a mock response to any service calling the corresponding API functions. For example, if an API has been registered for the service corresponding to endpoint 604, but endpoint 604 is not currently available, the API registry 404 can intercept a call made to endpoint 604 and provide default or dummy data in response. This allows services that call the service associated with endpoint 604 to maintain functionality and/or continue the design process without “breaking” the connection to this particular service. Mock/testing data scenarios will be described in greater detail below.

FIG. 7A illustrates a flowchart of a method for registering a service with the API registry 404, according to some embodiments. The method may include receiving an upload of an API definition (701). The API definition may be provided in the form of a data packet, file, or a link to an information repository. The API definition may include any information that can be used to identify and define API functions that should be bound to endpoints associated with the service. For example, some embodiments of the API definition may include the following data: a service name or other unique identifier; function names corresponding to service endpoints and calls, data inputs required to call the service with corresponding descriptions and data types; result data formats and data types; a current IP address and/or port number; documentation that describes the functionality of the API functions that will be associated with the endpoint; default or dummy data values that should be returned during mock/test scenarios; and any other information that may be used by the API registry 404 to translate the HTTP request received by the endpoint into a client library that uses API function calls of class data objects.

The method may also include creating corresponding API functions based on the uploaded API definitions (703). These API functions can be generated automatically based on the API definition. Each endpoint for a service may be associated with a plurality of different API functions. For example, an endpoint implementing a RESTful interface may receive HTTP calls for POST, GET, PUT, and DELETE functions at the same IP address and port number. This may result in, for example, for different API functions. For example, if the interface represents a list of users, this can correspond to at least four different API functions, such as GetUser( ) AddUser( ) RemoveUser( ) and UpdateUser( ) Additionally, each API function may include a number of different parameter lists, such as UpdateUser(id), UpdateUser(name), UpdateUser(firstname, lastname), and so forth. These API functions can be generated and made available to other services through the API registry. As will be described in greater detail below, it should be noted that services are not required to call these functions through the API registry. Instead, these functions are made available to browse in the API registry, and when selected, the API registry can generate client libraries that implement these functions in the calling service.

The method may additionally include creating a binding in the API registry between the API function and the corresponding endpoint of the service (705). Based on the discovery process described above and the registration process of steps 701, the API registry can now create a dynamic binding between an endpoint for a service in the container platform and the API function created by the API registry. In the data structure formed above when discovering available endpoints and services, the API registry can now store a corresponding function or set of functions for each endpoint. As described above, this binding can be constantly updated as the discovery process determines when services are updated, moved, replaced, or added to the container platform. This allows the client libraries created in a calling service to first check with the API registry to verify or receive a current IP address and port number for the service.

FIG. 7B illustrates a hardware/software diagram of the steps for registering an API with the API registry 404, according to some embodiments. As described above, the API registry 404 can be instantiated and running in a container 620 in the container platform 210. Even though the container platform 210 represents a production/deployment environment, the API registry 404 can still be accessed by the IDE 102 used to develop the service. Thus, the IDE 102 can provide a mechanism for uploading the API definition files 702 to the API registry 404. Specifically, the user interface of the IDE 102 may include a window or interface that allows the developer to define and/or populate fields for the API definition files 702. This information described above may include function names, parameter lists, data types, field lengths, object class definitions, an IP address and port number, a service name or other unique identifier, and so forth. This information can be uploaded to the API registry 404 and linked in a dynamic binding to a particular IP address and port number for the endpoint 602. Finally, the API registry 404 can generate one or more API functions 704 that can be made available through the API registry 404.

After registering a service with the API registry 404 and generating one or more API functions, the API registry can then make those functions available for developers as they design services. FIG. 8 illustrates examples of a graphical interface 802 and a command line interface 804 for browsing and selecting APIs that are registered with the API registry 804, according to some embodiments. When programming and developing a new service for the container platform, the developer can access the graphical interface 802 to browse and select API functions that can be used in their service. This graphical interface 802 is merely an example and not meant to be limiting of the types of graphical interfaces that can be used to browse and select API functions.

In this embodiment, the IDE 102 can summon the graphical interface 802 to provide a list of APIs that are registered with the API registry. In this embodiment, the APIs are categorized based on endpoint. For example, one endpoint corresponding to a service may offer a RESTful interface for storing user records (e.g., “UserStorage”). The graphical interface 802 can display all of the API functions (e.g., “CreateUser”, “DeleteUser”, “UpdateUser”, etc.) that are available through the selected endpoint. Other embodiments may group functions based on the overall service in cases where the service offers multiple endpoints. The graphical interface 802 can receive a selection of one or more API functions to be used in a calling the service. The API registry can then provide documentation that illustrates how to use the API function, including required parameters and return values. One having ordinary skill in the art will understand that the command line interface 804 can provide similar information and can receive similar inputs as the graphical interface 802.

The interfaces 802, 804 illustrated in FIG. 8 provide a number of technical benefits. First, these interfaces 802, 804 provide an up-to-date listing of all APIs that are registered with the API registry. This corresponds to a list of all services currently available in the container platform. Instead of being required to look up documentation, contact a service developer, and/or perform other inefficient tasks for locating a list of available services, a service developer can retrieve and display this information in real-time. Additionally, as services are updated, the API definition files can be updated in a corresponding fashion. This then updates the display illustrated in FIG. 8 to provide up-to-date availability information for each API function.

FIG. 9 illustrates a flowchart of a method for using a service and its corresponding function registered with the API registry, according to some embodiments. The method may include providing a listing of registered APIs (901). This step may be omitted in cases where the desired service is already known. However, generally the services can be displayed for browsing and navigation using the interfaces described above in FIG. 8. The method may also include receiving a selection of an API function (901). This selection may be received by the API registry from a developer of the service. For example, a developer may decide to update a database of user records using the CreateUser( ) function described above. FIG. 10 illustrates how a selection 1002 may be received by the API registry through the graphical interface 802 for the CreateUser( ) function. Other embodiments may receive the selection through the command line interface or through other input methods provided by the IDE.

Referring back to FIG. 9, once the selection of an API function is received, the API registry can generate one or more client libraries for the calling service (905). Generating client libraries may provide the calling service with the service endpoint that is dynamically bound to the API function. Specifically, the IDE can generate a set of class objects in the IDE that encapsulate the functionality required to interface directly with the service endpoint in the container platform. In some embodiments, client libraries may include object classes that can be instantiated or used to call member functions that embody the code required to communicate with the service. Examples of these client libraries will be described in greater detail below.

The method may additionally include providing test data (907). When a service is registered with the API registry, it need not be complete. Instead, the service can indicate to the API registry that it is not yet ready to provide functional responses to calling services. In some embodiments, the API definition file that is uploaded to the API registry can include a specification of the type of information that should be returned before the service is functional. When the calling service calls the API function, the client library generated by the API registry can route requests to the API registry instead of the service endpoint. The API registry can then provide a response using dummy, null, or default values. Alternatively, the code within the client libraries themselves can generate the default data to be returned to the calling service.

It should be appreciated that the specific steps illustrated in FIG. 9 provide particular methods of using an API registry according to various embodiments of the present invention. Other sequences of steps may also be performed according to alternative embodiments. For example, alternative embodiments of the present invention may perform the steps outlined above in a different order. Moreover, the individual steps illustrated in FIG. 9 may include multiple sub-steps that may be performed in various sequences as appropriate to the individual step. Furthermore, additional steps may be added or removed depending on the particular applications. One of ordinary skill in the art would recognize many variations, modifications, and alternatives.

FIG. 11 illustrates an example of a client library generated for a service automatically by the API registry, according to some embodiments. This client library 1102 may correspond to a service that stores user records. This client library 1102 and the corresponding class and service are provided merely by way of example and not meant to be limiting. As described above, each API function and service can specify how client libraries should be generated by virtue of the API definition file uploaded to the API registry. Therefore, the principles described below in relation to the “User” service may be applied to other services.

To represent the User service, the API registry can generate a class for a User. When the calling service requests client libraries to be generated by the API registry, the calling service can specify a programming language being used by the calling service. For example, if the calling service is being written in Java in the IDE, then the API registry can generate class libraries in the Java programming language. Alternatively, if the calling service is being written in C#, then the API registry can generate class libraries in the C# programming language. The User class can be generated to have member functions that correspond to different operations that may be performed through the service endpoint. These member functions can be static such that they do not require an instantiated instance of the User class, or they may be used with instantiated User objects.

In this example, the User service may use a RESTful interface to edit individual user records that are stored by the service. For example, the API registry can generate the CreateUser( ) function to implement a POST call to the User service. One of the functions that can be performed by the class library is to parse, filter, and format data provided as parameters to the API function to be sent as a data packet directly to the service. In this example, the CreateUser( ) function can accept parameters that are formatted for the convenience of the calling service. For example, the calling service may separately store strings for the user first name and the user last. However, the POST command may require a concatenated string of the first name in the last name together. In order to accommodate a user-friendly set of parameters, the client library 1102 can perform a set operations that format the data received as parameters to the function into a format that is compatible with the service endpoint. This may include generating header information, altering the format of certain data fields, concatenating data fields, requesting additional data from other sources, performing calculations or data transforms, and so forth. This may also include packaging the reformatted parameters into a format, such as JSON, XML, etc.

Once the parameters are correctly formatted into a package for the service endpoint, the client library 1102 can also handle the POST call to the service. When the client library is generated, the IP address and port number for the service can be inserted into the CreateUser( ) function to be used in an HTTP request to the service. Note that the details of the HTTP request are encapsulated in the CreateUser( ) function. When a developer for a calling service wants to use the POST function made available by the service, instead of writing the code in the library 1102 themselves, they can instead select the User service from the API registry. The API registry will then automatically generate the client library 1102 that includes the User class. Then, to use the POST function, the service developer can simply use the User.CreateUser(“John”, “Smith”, 2112) function to add the user John Smith to the service.

FIG. 12 illustrates an embodiment of a client library 1202 that accommodates dynamic binding between service endpoints and API functions, according to some embodiments. In this example, when the API registry generates the client library 1202, the CreateUser( ) function can include code 1204 that dynamically retrieves the IP address and port number for the service. The calling service 114 can use the GetIPPort( ) function to send a request to the API registry 404 at run time when the calling service 114 is operating in the production/deployment environment 104, such as the container platform. The API registry 404 can access its internal table that is consistently updated to maintain up-to-date bindings between the API functions and the service endpoints. The API registry 404 can then return a current IP address and port number to the calling service 114. The client library 1202 can then insert the IP address and port number into the HTTP POST code that connects to the service. Because the API registry 404 can be accessed at run time by any calling service in the container platform, none of these services need to be updated or patched when the IP address for port number for the service being called changes. Instead, the API registry 404 can provide up-to-date information every time a service is called. In some embodiments, the GetIPPort( ) function may only need to call the API registry 404 once an hour, once a day, once a week, and so forth, to minimize the number of function calls made outside of the container for the service 114 under the assumption that the service endpoints do not change frequently in the production environment.

FIG. 13 illustrates an embodiment of a client library 1302 that can marshal additional data to complete an input data set for a service call, according to some embodiments. To simplify using the client library 1302, the client library 1302 may minimize the number of parameters required from the service developer. Additional data that may be required to make the service call can be retrieved from other sources and thus may be omitted from the parameter list. These additional parameters can instead be retrieved directly by the client library 1302 from these other sources. For example, creating a new user may include specifying a user role for the user. Instead of requiring the service developer to provide a user role as one of the parameters, the client library 1302 can instead include code 1304 that automatically retrieves a role for the user from some other source. In this example, the user role can be retrieved from a database, from another service in the container platform, or from another class storing user roles within the calling service. In any of these cases, the code 1304 can automatically retrieve the user role and package it as part of the input data for the HTTP POST command sent to the service.

In addition to marshaling and formatting data for inputs to the service, the client library 1302 can also parse and return data received from the service and handle error conditions. In this example, the POST command may return a data packet into the Result variable. Often times, a service may return a data packet that includes more information than the calling service needs. Therefore, the client library 1302 can parse the data fields in the Result variable and extract, format, and package data from the Result variable into a format that is more usable and expected by the User class. In this example, the code 1306 can extract fields from the Result variable and use them to create a new User object that is returned from the API function. In another example using a GET command, individual API functions can be generated in the User class that extract different fields from the Result variable from the GET command. For example, the User class could provide a GetFirstName(id) function, a GetLastName(id) function, a GetRole(id) function, and so forth. Each of these functions may include very similar code while returning different fields from the Result variable.

In addition to parsing results, the client library 1302 may also generate code 1308 that handles error conditions associated with using the service. In this example, the code 1308 can test a status field in the Result variable to determine whether the POST command was successful. If the command was successful, then the CreateUser( ) function can return a new User object. In cases where the Post command failed, the function can instead return a null object and/or retry the call to the service.

FIG. 14 illustrates a client library 1402 that can handle retries when calling a service, according to some embodiments. Like the example of FIG. 13, the client library 1402 uses a status in a Result variable populated by the POST HTTP call to determine whether the call was successful or not. While the result is unsuccessful, the client library 1402 can continue to retry until the call is successful. Some embodiments may use a counter or other mechanism to limit the number of retries or add a wait time between retries.

As described above, some embodiments may also upload a set of API properties to the API registry along with the API definition. FIG. 15A illustrates a method of providing API properties to the API registry, according to some embodiments. The method may include receiving an upload of an API definition (1501). The method may also include receiving an upload of API properties (1503). The upload of properties may be part of the same transmission as the upload of the API definition. In some embodiments, the API properties may be part of the API definition. In some embodiments, the API properties may be one or more flags or predefined data fields that are checked to indicate that property should be set by the API registry. In some embodiments, the API properties need not conform to any pre-structured format, but can instead be represented by instruction code that causes the API registry to implement the features described below, such as authentication, encryption, and so forth. The API properties can be stored along with the API definition for each service.

The method may additionally include creating the API binding between the service and the API (1505). This operation may be performed as described in detail above. Additionally, the method may include using the API properties to perform one or more operations associated with the service (1507). The API properties may be used at different phases during the lifecycle of the service. Generally, this may be described as using the API properties to implement a function associated with the property during the deployment of a service, when generating client libraries for service, and/or when calling service. Examples of each of these functions will be described below in greater detail.

It should be appreciated that the specific steps illustrated in FIG. 15A provide particular methods of providing API properties to an API registry according to various embodiments of the present invention. Other sequences of steps may also be performed according to alternative embodiments. For example, alternative embodiments of the present invention may perform the steps outlined above in a different order. Moreover, the individual steps illustrated in FIG. 15A may include multiple sub-steps that may be performed in various sequences as appropriate to the individual step. Furthermore, additional steps may be added or removed depending on the particular applications. One of ordinary skill in the art would recognize many variations, modifications, and alternatives.

FIG. 15B illustrates a hardware/software diagram of how a service can provide API properties to the API registry, according to some embodiments. While developing a service in the IDE 102, a service developer can provide the API definition file 1502 and one or more properties 1504 to the API registry 404. Because the API registry 404 is accessible in both the IDE 102 and the container platform at runtime, the API registry 404 can store the properties 1504 and use them to affect how a service is deployed, called, and/or used to generate client libraries during both development and runtime scenarios.

FIG. 16 illustrates a hardware/software diagram where a property is used by the API registry to deploy a service with high availability, according to some embodiments. In addition to the API definition file 1505 for a particular service, the API registry 404 may receive a property 1602 indicating that the service should be deployed to be very resilient, or have high availability. This property 1602 may be received as a set of instructions that are executed by the API registry 404 to deploy the service to have high availability. This option allows the developer to define what it means to be “high-availability” for this service. For example, the property 1602 may include instructions that cause the API registry 404 to deploy multiple instances 602, 604 of the service to the container platform 210. By executing these instructions, the API registry 404 does not need to make any decisions or determinations on its own, but can instead simply execute the deployment code provided as part of the property 1602.

The property 1602 may also be received as a flag or setting that indicates to the API registry 404 an option to execute existing instructions at the API registry 404 for deploying the service with high availability. With this option, the API registry 404 need not receive any code to be executed as the property 1602. Rather, the API registry 404 can recognize the high-availability property 1602 and execute code that is maintained in the API registry 404 to deploy multiple instances 602, 604 of the service. This allows the API registry 404 to define what it means to be “high-availability” for the deployment of any service that is registered with the API registry 404.

Because the API registry 404 is connected to the runtime environment of the container platform 210, the API registry 404 can interact with the container platform 210 to deploy the instances 602, 604 that determine the runtime availability of the service. Note that the two instances 602, 604 of the service illustrated in FIG. 16 are provided merely as an example and not meant to be limiting. A high-availability service may include more than two redundant instances of a service being deployed to the container platform.

Some embodiments may include code in the API registry 404 that can be executed as a default. If the property 602 includes only a simple indication that high availability is desired, the API registry 404 can execute its own code. If the property 602 includes deployment code for deploying the service, the API registry 404 can instead execute the code of the property 1602. In some cases, the property 1602 may include only a portion of the code needed to deploy the service with high-availability. The API registry 404 can execute the portions of the code that are provided by the property 1602, then execute any code not provided by the property 1602 using the code at the API registry 404. This allows developers to overwrite an existing definition of how to execute a property, such as high-availability, at the API registry 404, while still allowing the API registry 404 to provide a uniform definition for executing properties that can be used by registered services.

In some embodiments, a high-availability property may also cause the container platform 210 to deploy a load balancing service 606 that distributes requests to the multiple instances 602, 604 of the service. The endpoint of the load balancing service 606 can be registered with the API registry 404 and made available to other services. Alternatively or additionally, each of the multiple instances of the service 602, 604 may be registered with the API registry 404 as service endpoints.

In each of the examples described below, the same principles discussed in relation to FIG. 16 may apply. For example, any property described below may be accompanied with code that may be received by the API registry 404 and used to overrule code that would otherwise be executed by the API registry 404. Prior to this disclosure, no method existed for creating a uniform default for executing properties while simultaneously allowing service developers to overrule those properties if needed. Therefore, the API registry 404 solves a technical problem by allowing code to be executed at the API registry 404 as a default while still allowing that code to be overruled by a property 1602 received from a developer.

FIG. 17 illustrates a hardware/software diagram of a property that enforces end-to-end encryption through the API registry, according to some embodiments. Along with the API definition file 1505, the API registry 404 may receive a property 1704 that indicates, or includes code that generates, end-to-end encryption for calling the service 1708. During development, the service 1708 can include its own decryption/encryption code 1710 that causes packets received by the service 1708 to be decrypted and packets returned by the service 1708 to be encrypted. Prior to this disclosure, the developer would need to provide a specification that indicated users of the service 1708 needed to provide encryption to be compatible with the service 1708. This embodiment solves a technical problem by allowing the service 1708 to dictate how the client libraries are generated in a calling service 1706, which ensures compatibility with the encryption of the service 1708.

In some embodiments, the developer of the service 1708 need not include the encryption/decryption code 1710 in the service 1708. Instead, the property 1704 can simply instruct the API registry 404 to enforce end-to-end encryption for the service 1708. When the service 1708 is deployed to the container platform 210, the API registry 404 can cause the encryption/decryption code 1710 to be inserted into the service 1708 when it is deployed. This allows the developer to select between different encryption regimes based on the property 1704 and/or to allow the API registry 404 to select a preferred encryption regime as a default.

End-to-end encryption requires not only the encryption/decryption code 1710 to be inserted into the service 1708 when it is deployed or during development, but it also requires that a calling service 1706 also includes compatible encryption/decryption code. As described above, when the calling service 1706 needs to use the service 1708, the API registry 404 can generate one or more client libraries 1702 that completely implement the code needed to interact with the service 1708 in a simple and efficient manner. When this client library 1702 is generated, the API registry 404 can analyze the property 1704 to determine an encryption regime used by the service 1708. Then, based on that property 1704, the API registry 404 can cause a compatible encryption/decryption code to be added to the client library 1702 for the calling service 1706. Thus, when the calling service 1706 sends a request to the service 1708, the information may be encrypted at the calling service 1706 and decrypted once received by the service 1708. Similarly, the service 1708 can encrypt a response before it is sent to the calling service 1706, which can then decrypt the response before passing the response outside of the client library 1702. This causes the entire encryption process to be entirely transparent to a developer of the calling service 1706. Instead of being required to implement a compatible encryption/decryption regime when calling the service 1706, the property 1704 may ensure that the API registry 404 has already generated the encryption/decryption code in the client library 1702 to be compatible and implement the end-to-end encryption property.

FIG. 18 illustrates a property 1804 for an API registry to implement usage logging for a service 1808, according to some embodiments. Prior to this disclosure, to monitor and log the frequency, source, success rate, etc., of requests to a service, the service itself had to log this information. Alternatively, the container environment had to monitor the service and log its usage information. Logging information at the service 1808 itself is terribly inefficient, and slows down the throughput for every request handled by the service. Similarly, the overhead of requiring the container platform to monitor and log all the calls made to particular services also represents a tremendous overhead to the scheduling and orchestration of container services. This embodiment solves this technical problem by inserting code directly into client libraries for services that call the service 1808. This allows the usage of the service 1808 to be logged and monitored without affecting the performance of the service 1808 at all in terms of memory usage or CPU usage.

In addition to the API definition file 1505, the API registry 404 can receive a property 1804 that indicates, or includes code that implements, usage logging 1804. When a developer of a calling service 1806 desires to submit requests to the service 1808, the API registry 404 can automatically generate a client library 1802 that includes code for logging activity related to the service 1808. As described above, this code can be generated based on default code maintained and executed by the API registry 404, or can be generated by code received with the property 1804 and executed by the API registry 404.

The code for logging activity in the client library 1802 may include counters that are incremented every time the service 1808 is called, functions that cause activity to be logged to a log file when the service 1808 is called, and other functions that monitor and record characteristics of the requests sent to the service 1808 and responses received from the service 1808. Depending on the particular embodiment, this code may monitor many different types of characteristics associated with requests made of the service 1808. For example, some embodiments may log the total number of calls made to the service 1808. Some embodiments may log a success rate for responses received from the service 1808. Some embodiments may log types of data that are sent in requests to the service 1808. Some embodiments may log times of day or other external information for when the service 1808 is called. Some embodiments may log input and output packets to/from the service 1808 that can be used for debugging the service 1808. Some embodiments may log any or all of these characteristics in any combination and without limitation.

FIG. 19 illustrates a hardware/software diagram of a property 1904 that can enforce an authentication protocol for a service 1908, according to some embodiments. Some services may require that a user identity be authenticated and that the user be authorized to use the service before responding to a request. Prior to this disclosure, a technical problem existed where the authentication and authorization procedures took place at the service 1908 itself. This added overhead in terms of memory usage and CPU usage for every call received by the service 1908, and increased the latency of the service in response. This in turn decreased throughput, and limited the number of requests that could be processed by the service 1908 during any given time interval. These embodiments solve this technical problem by moving authentication/authorization code to the client library 1902 that is automatically generated by the API registry 1404.

When a calling service 1906 wants to use the service 1908, the API registry 404 can generate the client library 1902 that includes code for performing the authorization and/authentication. In some embodiments, this may include contacting external authentication/authorization services 1920 that specifically verify user identities and/or determine whether a user is authorized to use the service 1908. The external authentication/authorization services 1920 may include an access manager, a Lightweight Directory Access Protocol (LDAP) manager, an Access Control List (ACL), a network authentication protocol manager, and so forth. The code in the client library 1902 can then send the call to the service 1908 when the authentication/authorization procedure is successful.

By offloading the authentication/authorization enforcement to the API registry 404 and the client library 1902, this code can be completely eliminated from the service 1908. Because significant delays may often accompany interacting with the external authentication/authorization services 1920, this delay can be removed from the service 1908 to increase throughput. Additionally, rather than hard coding the authentication/authorization enforcement into the service 1908, the developer of the service 1908 can instead simply select a predefined authentication/authorization regime using the property 1904 that is sent to the API registry 404. The API registry 404 can maintain a predefined list of authorization/authentication with the accompanying implementation code for the client library 1902. This also prevents the calling service 1906 from sending requests to the service 1908 that cannot be authorized and/or authenticated. Instead, if the authentication and/or authorization routine is unsuccessful, the call can be aborted at the client library 1902. This ensures that the service 1908 only receives requests that are authenticated and/or authorized.

Another technical improvement provided by the API registry 404 is the ability to upgrade any of the functionality provided by the properties 1904 without being required to change any of the code of any registered services. For example, because the authentication/authorization code has been offloaded to the client library 1902 generated by the API registry 1404, the client library 1902 can be updated to change the authentication/authorization regime. None of the code in the calling service 1906 or the service 1908 needs to be modified. Because code is only changed in a single place, this greatly reduces the probability of code integration errors that would otherwise accompany distributed patches sent out to every individual service.

FIG. 20 illustrates a hardware/software diagram for a property 2004 that enables runtime instantiation of a service, according to some embodiments. Some services may be rarely used or only used during predefined time intervals. Therefore, deploying a service to the container platform need not always result in actually instantiating an instance of the service in a container that is immediately available. In contrast to virtual machines, containers can be instantiated and activated very quickly. Therefore, a service developer may desire to only have the service instantiated when it is called. A service developer may also desire to only have the service instantiated within a predefined time interval. Similarly, the service developer may specify that the service instance should be deleted after a predefined time interval of inactivity.

In addition to receiving the API definition file 1505, the API registry 404 can receive a property 2004 that specifies run-time instantiation or other instantiation parameters. For example, the property may include a specification of one or more time intervals during which the service 2008 should be instantiated after deployment. In another example, the property may include an indication that the service 2008 should only be instantiated on demand. In another example, the property may specify a timeout interval after which the instantiated service 2008 should be deleted from the container platform.

When a calling service 2006 wants to use the service 2008, the API registry 404 can generate code in the client library 2002 that handles the run-time instantiation of the service 2008. For example, the CreateInstance( ) function call in the client library 2002 can create a call to the API registry 404. The API registry can then interact with the container platform 210 to determine whether an operating instance of the service 2008 is available. If not, the API registry 404 can instruct the container platform 210 to instantiate an instance of the service 2008 in a container in the container platform 2010. The container platform 210 can then return the endpoint (e.g., IP address and port number) to the API registry 404. The API registry 404 can then create a binding between that endpoint and the API function call created in the client library 2002. API registry 404 can then return the endpoint to the client library 2002 which can be used to create the direct connection between the calling service 2006 and the newly instantiated service 2008.

For services that should only be instantiated during predefined time intervals, the API registry 404 may establish a table of instantiation and deletion times for certain services. Based on these stored instantiation/deletion times, the API registry 404 can instruct the container platform 210 to instantiate or delete instances of the service 2008. The API registry 404 can also specify a number of instances that should be instantiated during these predefined intervals. For example, from 5:00 PM to 10:00 PM the property 2004 may specify that at least 10 instances of the service 2008 are active on the container platform 210. When this time interval occurs, the API registry 404 can instruct the container platform 210 to create the additional instances.

FIG. 21 illustrates a hardware/software diagram of a property 2104 that implements a rate limiting function for a service 2108, according to some embodiments. Some services may need to limit the rate at which requests are received. Other services may need to limit requests from certain senders or types of services. Prior to this disclosure, this function had to be performed by the service itself by determining a source for each request, comparing the source to a whitelist/blacklist, and throttling the rate at which it serviced these requests. As with most of the examples described above, placing this overhead in the service itself increase the amount of memory and CPU power used by the service and limited the throughput of the service. These embodiments solve this technical problem by automatically generating the rate limiting code in the client library generated by the API registry. This allows the service to specify rate limiting by virtue of the property 2104 without requiring the service 2108 to implement that functionality with all of its associated overhead.

When a calling service 2106 wants to send requests to the service 2108, the API registry 404 can automatically generate the client library 2102 that includes rate limiting code. When the client library 2102 is generated, the API registry 404 can determine whether the particular service 2106 should be rate limited. If not, the client library 2102 can be generated as usual. If the API registry 404 determines that the calling service 2106 should be rate limited (e.g., by comparison to a whitelist/blacklist), then the API registry 404 can insert code in the client library 2102 that adds delays, adds counters, and/or otherwise implements the rate limiting function to ensure that a predefined maximum number of requests are made by the calling service 2106 in any given time interval according to a predefined rate. This code may also implement time windows during which the rate limiting function will be active. This allows the service 2108 to enforce rate limiting during high-traffic intervals automatically.

A microservice architecture may include a plurality of micro-services that can be used to implement larger services. A microservice architecture uses a software design technique that focuses on breaking up large tasks into many fine-grained individual tasks referred to as microservices. For each microservice, the corresponding micro-task to be achieved is simple and the protocol is generally lightweight. This architecture is an evolution from the service-oriented architecture (SOA) architectural style that designed large applications as a collection of small, loosely coupled services. A microservice architecture refines this concept by further distilling each service into a set of simple, complete, and largely independent tasks that can be designed, developed, deployed, and operated alone or in combination with other microservices to implement complex operations. Among the benefits of decomposing an application into microservices is the resulting modularity of simple, easy-to-understand software designs. Because of this modularity, microservice applications are easier to design, test, and maintain, and are more resilient to obsolescence. Microservices can be developed in parallel by small autonomous teams and deployed independently. Microservices can also be continuously re-factored and restructured without changing their external behavior or their operation in a larger application or service framework.

As defined herein, a microservice can be contrasted with a regular service or application by the scope of processing encapsulated by the microservice. Microservices often communicate over a network using technology-agnostic protocols, such as HTTP. Microservices are independently deployable and generally share very little information with other microservices outside of their available interface. In contrast to the overhead of the SOA architecture, microservices can use simple messaging instead of an enterprise service bus (ESB).

Instead of sharing components, microservices aim to minimize sharing through a bounded context that couples microservice code and data as a single unit with minimal dependencies. Finally, the prefix “micro” in microservices makes reference to the small granularity of its internal operations. Microservices are generally single-purpose services that perform a single task as efficiently as possible. Services in general usually include much more business functionality and are often implemented as complete subsystems.

For purposes of this disclosure microservices also lend themselves very well to a containerized environment. Note that containers and microservices are not the same concept; containers are software tools that are well-adapted to organize and develop microservices. The container packages all the software, runtime, system tools, system libraries, settings, and so forth that allow a microservice to be relatively isolated from other software in containers in the same environment while providing everything required to run the microservice. This allows microservices to focus on their task execution rather than system interactions and overhead. Additionally, the container platform environment is particularly well-suited for microservices. For example, the container platform can readily deploy and manage multiple containers of the same type and containers of many different types simultaneously. Essentially, the container platform can handle the allocation and deployment of containerized microservices to fit the available hardware in the platform.

FIG. 22 illustrates an example of a service that is built from a plurality of micro-services, according to some embodiments. A top-level service 2216 may be used to implement a standard business process. The top-level service 2216 may be comprised of a plurality of micro-services that implement single, discrete software steps in the business process. Each of microservices are self-contained processes, and the service 2216 includes all the microservices necessary to implement the service 2216. In this embodiment, the service 2216 can be distinguished from any of the microservices by virtue of the fact that the microservices represent single steps in performing the overall service 2216, and the microservices are logically encapsulated within the service 2216.

One example of such a top-level service 2216 may include a ProcessOrder( ) service for processing a customer order. The business process implemented by the service 2216 may include a number of discrete steps, such as adjusting inventory, generating an invoice, charging a user account, creating a shipping order, updating a ledger, and so forth. Each of these discrete steps may be implemented by microservices that build the service 2216. Each of these microservices may also be deployed into a container within a pod represented by the service 2216. For example, these microservices may include functions such as AdjustInventory( ) CreateShipper( ) Invoice( ) ChargeAccount( ) UpdateLedger( ) EmailReceipt( ) and CreateUserAccount( ) which may be represented by the microservices 2202, 2204, 2206, 2208, 2210, 2212, and 2214. Each of these microservices can make calls to each other within the service 2216. The service 2216 may be represented by a node or pod in the container platform and may also include an endpoint 2218 that is available through an HTTP call for other services in the platform.

In some cases, a microservice may be built from a single process encapsulated in a single container. For example, microservices 2202, 2206, 2208 and, 2212 in FIG. 22 are comprised of single services in single containers. However, other microservices can include a collection of functions that are encapsulated in a single container. For example, microservices 2204 and 2214 include multiple functions in a single container. Furthermore, some microservices, such as microservice 2210 may include multiple functions in multiple containers encapsulated in a single microservice. Each of these microservice designs are included in the term “microservice” as used herein. Additionally, each microservice may be encapsulated in a pod as illustrated in FIG. 22. However, microservices may also be represented by containers and collected together in a pod or container node to form the service 2216, according to different embodiments.

FIG. 23 illustrates a deployment environment for the service 2216, according to some embodiments. Some embodiments of the container platform 210 may utilize a container platform scheduler 2306 to scale and deploy a large number of containers across the container platform 210. When services are scaled out across multiple host systems within the container platform 210, the ability to manage each host system and abstract away the complexity of the underlying container platform may become advantageous. “Orchestration” is a broad term that refers to container scheduling, managing clusters of nodes, and the possible provisioning of additional hosts in the container environment. “Scheduling” in the container platform may refer to the ability to load a service onto a host system and establish how to run that specific container. While scheduling specifically refers to the process of loading the service definition, it can also be responsible for managing other aspects of the operation of the node duster. Managing a cluster of nodes involves a process of controlling the group of computing hosts. This is closely tied to scheduling because the scheduler 2306 may need to access each node in the cluster in order to schedule services as defined above. However, the process of the scheduler 2306 of most interest to the embodiments described herein involves host selection. In this sense, the scheduler 2202 may be tasked with automatically selecting a container environment on which to deploy and run a specific service built from a plurality of microservices.

In addition to the scheduler 2306, the container platform may include an API registry 404 as described in detail above. The API registry 404 may receive the service 2216 from the development environment of the IDE and work together with the scheduler 2306 to deploy the service 2216 into the container platform 210. As described above, the service 216 can be deployed based on one or more properties provided to the API registry 404, such as accessibility, security, encryption, authentication, and so forth. These properties can determine additional code that is added to either a calling service or to the service 2216 itself when the services are deployed to the container platform. However, the properties described above generally need not be concerned with the location of the service 2216 when it is deployed to the container platform.

Prior to this disclosure, a technical problem existed in the art based on deployment of a service. Because the service is meant to be a self-contained functional block, the inner workings of the service were deployed together in a pod/node into a single computing environment. For example, when the service 2216 is deployed to the container platform 210, all of the microservices 2202, 2204, 2206, 2208, 2210, 2212, and 2214 would be deployed together into a first environment 2302. Because it is assumed that the microservices 2202, 2204, 2206, 2208, 2210, 2212, and 2214 operate together as a logical and functional block, the service 2216 would always be deployed to a single environment without splitting up any of its internal components to other environments.

As used herein, the term “environment” may include a computing environment. Two different environments would be considered functionally and physically separate from each other. By way of example, the first environment may include a cloud platform that can receive containerized services and microservices. A second environment 2304 in the container platform may include an on-premise environment that is physically and functionally separate from the cloud platform of the first environment 2302. The first environment 2302 and the second environment 2304 may be physically separated by a distance of more than 1 mile, and may operate using separate and distinct processors, memory devices, network interfaces, other computer hardware. The first environment 2302 and the second environment 2304 may operate using different operating systems, or different instances of the same type of operating system. Additionally, the first environment 2302 and the second environment 2304 may have different characteristics that are considered by embodiments described below. Specifically, these environments 2302, 2304 may have different operating characteristics associated with cost, availability, security, latency, and other general characteristics associated with a computing environment.

Because the service 2216 would be deployed to a single environment, such as the first environment 2302, in systems operating prior to this disclosure, the deployment of the service 2216 could not be optimized to be deployed across separate physical environments, including the second environment 2304. Because these different environments 2302, 2304 have different characteristics, some of the microservices 2202, 2204, 2206, 2208, 2210, 2212, and 2214 may also have characteristics that make some of the microservices operate more efficiently on different environments. However, services were not split between separate environments. Furthermore, there was no method or process available for determining which microservices should be placed on which environments to optimize the overall performance of the service 2216. Finally, the service itself 2216 can be associated with a deployment criteria, such as cost, availability, latency, security, the location, and so forth, indicating a desired characteristic of the service 2216 to be optimized during deployment. However, container platforms were unable to recognize, process, and optimize service deployment based on deployment attributes. This led to inefficient service operation, including increased latency, inefficient memory usage, and increased processing requirements for the host environment, such as the first environment 2302.

The embodiments described herein solve these and other technical problems and improve the overall functioning of the cloud platform 210 by receiving one or more criteria associated with the deployment of the service 2216 that specifies an operational aspect of the service 2216 that should be optimized by the deployment. The scheduler 2306 and/or the API registry 404 can access attributes associated with each of a plurality of environments and determine how best to deploy the microservices that make up the service 2216 across any of the available environments to best meet the deployment criteria of the service 2216. This not only optimizes the operation of the service 2216 itself, but also optimizes the use of the container platform resources. For example, a deployment criteria to optimize latency reduces the number of inter-environmental calls between environments and minimizes the latency of the service itself. A deployment criteria to optimize security places microservices with stringent security requirements on the most secure environments, but allows microservices with less stringent security requirements to be placed on more efficient servers, thus reducing the processing and resource overhead associated with security functions for microservices that do not require such.

FIG. 24 illustrates a flowchart of a method for optimizing service deployment across a plurality of computing environments, according to some embodiments. The method may include receiving a service that includes a plurality of containerized microservices (2401). The service in microservices may be arranged as described above in FIGS. 22-23. The services may be received by a scheduler, an orchestration platform, and/or API registry for deployment. The method may also include receiving a deployment criteria for deploying the service in a plurality of environments (2403). As described above, the plurality of environments may be physically separate environments and have different operating characteristics. Thus, the method may also include accessing characteristics of the plurality of environments (2405). Based on the characteristics of the plurality of environments and the deployment criteria for the service, the method may also include deploying the plurality of micro-services across the plurality of environments (2407). Examples of how the plurality of micro-services can be deployed across the plurality of environments are described in greater detail below.

FIG. 25 illustrates a diagram of service criteria and environment/microservice attributes, according to some embodiments. The service 2216 may include a plurality of micro-services as described above. Additionally, the service 2216 may be associated with one or more deployment criteria 2502. In some embodiments, the deployment criteria 2502 may include a list of options comprising different aspects of operating the service 2216 can be optimized through various deployment strategies in the container platform. For example, in FIG. 25, the criteria 2502 include a list of checkboxes for options that can be selected by a developer before the service 2216 is deployed. The scheduler and/or API registry can then access predetermined code sequences stored therein that can execute the deployment of the service 2216 according to the selected criteria 2502. In some embodiments, multiple selections in the criteria 2502 can be selected, and the overall optimization process can optimize the deployment to balance the one or more criteria 2502 that have been selected.

In addition to an overall criteria 2502 for the service 2216, each of the microservices in the service 2216 may have one or more attributes 2510 that provide a value associated with each of the criteria 2502 specified for the service 2216. The value associated with each of these attributes 2510 may characterize how the associated microservice should be evaluated according to the criteria 2502. For example, for criteria specifying that the deployment of the service 2216 should be optimized for security concerns, the security attribute of the microservice may include a value that characterizes a relative security requirement for that microservice in comparison to other microservices in the service 2216. In another example, for criteria specifying that the deployment of the service 2216 should be optimized for latency, the latency attribute of a microservice may characterize an execution latency for the associated microservice in relation to other microservices in the service 2216. In another example, for criteria specifying that the deployment of the service 2216 should be optimized for co-location on the same computing environment, the value for the co-location attribute of a microservice may characterize inter-service connections with other microservices, such as a number or an identification of other services that call or are called by this microservice. In another example, for criteria specifying that the deployment of the service 2216 should be optimized for availability, the value of the availability attribute of the microservice may characterize how exposed the microservice is to frontend requests versus backend processing. In addition to these examples, any other criteria that specifies how the deployment of the service can be optimized may correspond to attributes for each microservice in the service with associated values that characterize the way in which that microservice can be placed in various environments to optimize deployment according to the criteria.

As described above, the container platform may include a plurality of environments, such as a first environment 2302 and a second environment 2304. Note that the use of these two environments is provided merely by way of example and is not meant to be limiting. Other embodiments may include any number of computing environments of any type with hardware platforms that extend beyond cloud deployments and on-premise deployments. Regardless of the number of environments, each environment may include a set of attributes 2504, 2506 with accompanying values that characterize how that environment relates to the corresponding criteria for optimizing deployment. For example, for criteria that minimizes cost, the corresponding cost attribute of the first environment 2302 can characterize a cost of deploying microservices to the first environment 2302 relative to deploying microservices to other environments. In another example, for criteria that specify maximizing availability for the service, the corresponding availability attribute of the first environment 2302 can characterize the availability of the first environment 2302 relative to other environments in the container platform. Other similar examples may include attributes that characterize the latency of micro-services operating on the environment, a security level of the environment, and any other operating characteristic of a computing environment.

The API registry 404 and/or the scheduler 2306 may receive the criteria 2502 for deploying the service 2216, the attributes 2510 for each of the microservices in the service 2216, and/or the attributes for each of the operating environments 2302, 2304, etc., in the container platform. The scheduler 2304 and/or API registry 404 can then match microservices to computing environments by matching attributes that are specified by the criteria 2502. For example, for criteria specifying security optimization, microservices with high security requirement values can be matched with environments with high security rating values, while microservices with low security requirement values can be matched with computing environments with lower security rating values. When multiple criteria 2502 are specified for optimization during deployment, the API registry 404 and/or container platform 2306 can rank each of the criteria 2502 and use a weighted combination of values to minimize a cost equation with variables representing locations for each microservice to be deployed. A cost equation can be minimized to yield an optimized deployment map for each microservice in the service 2216 that specifies a selected computing environment for each microservice. Specific examples of how this deployment optimization can be calculated in practice are described in detail below.

FIG. 26 illustrates an example of a service deployment that is optimized to minimize operating costs, according to some embodiments. When the scheduler 2306 and/or the API registry deploys the service 2216, the cost criteria can be selected, and the attributes of each of the microservices can be analyzed. Next, the attributes of each of the computing environments 2302, 2304 can also be analyzed and the deployment of the microservices across the computing environments 2302, 2304 can be optimized accordingly. For example, when analyzing the attributes of the microservices, it may be determined that the value for the attributes of microservice 2214 and microservice 2212 may specify a low cost value. This may indicate that these microservices 2214, 2212 may have a small memory and/or processing footprint that may cost less to deploy to the cloud platform of the first environment 2302. Note that other examples may use other metrics to determine cost, such as number of requests received, bandwidth requirements, and so forth. Because these services may cost relatively little to deploy to the cloud environment, the API registry 404 and/or scheduler 2306 can select the first environment 2302 for microservices 2214, 2212.

In contrast, the values for the cost attributes of microservices 2210, 2206, 2202, 2208, and 2204 may indicate a relatively high cost value for deployment on a cloud platform. Therefore, the API registry 404 and/or scheduler 2306 can match these high cost values with the attributes of the second environment 2304 corresponding to an on-premise computing platform. The second environment 2304 may have a high value for its cost attribute indicating that high-cost microservices should be deployed thereon.

After matching microservices to computing environments based on attribute values that correspond to the specified deployment criteria, the API registry 404 and/or scheduler 2306 can deploy the individual microservices to the selected computing environments and provide an endpoint 2218 for accessing the overall service. As described above, the API registry 404 can provide a stable endpoint for the overall service by binding the endpoint to a set of one or more API functions defined by an API definition file uploaded to the API registry 404. In embodiments without the API registry 404, the scheduler 2306 can deploy the microservices in a similar fashion and provide the endpoint 2218 to be available to HTTP service calls.

FIG. 27 illustrates an example of a service deployment that is optimized for service availability, according to some embodiments. As described in the example above for FIG. 26, the API registry 404 and/or the scheduler 2306 can receive one or more selected criteria for deployment of the service specifying that the deployment should be optimized for maximum availability of the service. The API registry 404 and/or the scheduler 2306 can then match values for attributes in the computing environments 2302, 2304 and the microservices for the availability attribute in each. In this example, microservice 2214 may implement a front end interface with a relatively high availability value. This can be matched to the on-premise system of the second computing environment 2304 which may also have a high availability value indicating that the second environment 2304 is highly available.

In contrast, the other microservices 2202, 2204, 2206, 2208, 2210, and 2212 may have relatively low attribute values in comparison to microservice 2214 indicating that high availability for these microservices is not essential for the overall service to be highly available. This attribute value can be matched to the second computing environment 2302 for the cloud platform that may not have as high an availability rating as the second environment 2304. The API registry 404 and/or scheduler 2306 can then deploy each of microservices to their corresponding selected computing environments and provide an endpoint 2218 for the overall service.

FIG. 28 illustrates an example of a service deployment that is optimized for security concerns, according to some embodiments. In this example, the API registry 404 and/or scheduler 2306 may receive the microservices defining the overall service for deployment, along with a criteria that specifies that the deployment should be optimized to match the security of the computing environments to the security requirements of each microservice. To implement this deployment optimization, the API registry. 404 and/or scheduler 2306 may match the values of the security attributes of each microservice to the corresponding security value rating of each of the computing environments 2302, 2304. In this example, the first computing environment 2302 for the cloud platform may have a relatively high value rating for the security attribute indicating that the cloud platform can provide better security than other computing environments. Similarly, microservices 2202 and 2204 may also have relatively high value requirements for the security attribute indicating that they process sensitive data requiring a high level of security protection. The API registry 404 and/or scheduler 2306 can match these relatively high values for the security attributes and select the first computing environment 2302 for microservices 2202, 2204.

Similarly, the second computing environment 2304 for the on-premise platform may have a security rating that is lower than that of the first computing environment 2302. Thus, microservices 2206, 2208, 2210, 2212, and 2214 can be matched to the second computing environment 2304 if they have value requirements that are relatively lower than the requirements of microservices 2202, 2204. After making these determinations, the API registry 404 and/or scheduler 2306 can deploy each of the microservices to their selected computing environments and provide a stable endpoint 2218 for the overall service.

FIG. 29 illustrates an example of a service deployment that is optimized for processing speed, according to some embodiments. A speed criteria may be similar to an latency criteria for the service. The API registry 404 and/or scheduler 2306 can receive the microservices defining the overall service along with a criteria specifying that the deployment should be optimized for processing speed. The speed attribute for the first computing environment 2302 may indicate that the first computing environment 2302 has less computing resources available than other environments in the container platform. Thus, the first computing environment 2302 can be matched to microservices 2202, 2204, and 2208 that have relatively low values for their speed attributes, indicating that they do not require a relatively high amount of processing power to operate efficiently.

Additionally, the API registry 404 and/or scheduler 2306 can determine that the speed attribute for the second computing environment 2304 is relatively high, indicating that the second computing environment 2304 has more processing resources available than other environments in the container platform. Thus, the second computing environment 2304 can be matched to microservices 2206, 2210, 2212, and 2214 with speed attribute values indicating they require a relatively high amount of processing resources. Each of the microservices can then be deployed by the API registry 404 and/or scheduler 2306, and the endpoint 2218 can be provided for the overall service.

FIG. 30 illustrates an example of a service deployment that is optimized according to multiple criteria simultaneously, according to some embodiments. In this example, the criteria accompanying the service may indicate that the deployment should be optimized to minimize calls made between different computing environments and to maximize processing speed. Normally, minimizing microservice calls between computing environments would be best accomplished by placing all microservices on a single computing environment. However, by specifying additional criteria, the deployment may also match the processing requirements of microservices to processing capabilities of computing environments.

For example, by matching attribute values as described above, the API registry 404 and/or scheduler 2306 can match the processing requirements of microservices 2202, 2204, and 2208 to the processing capabilities of the first computing environment 2302. Similarly, the API registry 404 and/or scheduler 2306 can match the processing requirements of microservices 2206, 2210, 2212, and 2214 to the processing capabilities of the second computing environment 2304. However, instead of deploying these microservices to the selected computing environments, further optimizations can be made to co-locate microservices that have a pattern of service calls that can be grouped together. For example, microservices 2202, 2204, 2206, and 2208 all exchange multiple inter-service calls between each other. In order to minimize the number of cross-environment service calls, microservice 2206 can be moved from the second computing environment 2304 to the first computing environment 2302. This type of deployment balances both of the deployment criteria of maximizing computing speed while minimizing cross-environment service calls. This example may also give slight preference to minimizing cross-environment service calls by applying a higher weight to this attribute in a cost minimization function referenced above.

In the examples described above, the optimization of microservice deployment has been executed during an initial deployment of the service to the plurality of computing environments. However, the same process for optimizing an initial deployment can be used to optimize a subsequent deployment. For example, after an initial deployment, the criteria of a service may be changed by a developer, administrator, or by an automatic process operating on the container platform. When this criteria changes, the same process described above can be executed again to match values for environment and microservice attributes for the new criteria selections. The microservices for the service can then be re-deployed according to the new criteria.

Similarly, values for attributes of computing environments may change over time. For example, the first environment 2302 may receive a hardware upgrade that increases the amount of processing resources available. This may lead to a corresponding increase in attribute characterizing an amount of computing resources available in the first environment 2302. The API registry 404 and/or scheduler 2306 can then determine whether the changed attribute corresponds to one of the criteria for the service deployment. If the attribute does correspond to a deployment criteria, the service can be redeployed to optimize based on the changed attribute value for the computing environment.

In addition to changes of attribute values for the computing environments, attributes of microservices in the service may also change over time. One of the advantages of using containerized microservices is that these microservices can be independently upgraded, replaced, and redeployed over time. If one or more microservices are upgraded for the service, they may have new values assigned to their corresponding attributes if the performance or requirements of the microservice are changed following the upgrade/redeployment. As described above, the API registry 404 and/or scheduler 2306 can determine whether the attribute with the changed value corresponds to a deployment criteria for the overall service. If so, the API registry 404 and/or scheduler 2306 can redeploy at least the affected microservice to optimize the deployment according to the changed attribute value.

In the examples described above, the attribute values for both computing environments and microservices are compared to each other in a relative fashion. For example, if the first environment 2302 has a higher security rating value than the second environment 2304, then microservices having a security requirement that is relatively higher than other microservices can be placed in the first environment 2302. The security requirement represented by the value of the security attribute can be compared relatively between microservices in the service, such that microservices with higher requirements can be deployed to the first environment. This relative comparison can be made by using absolute thresholds. For example, predetermined thresholds can be used to separate values for any attributes into various groups. In some embodiments, the microservices can be clustered into a number of groups corresponding to the number of computing environments. Using FIG. 30 as an example, microservices 2202, 2204, 2206, 2208 can be clustered together by attribute value to form a first group while microservices 2210, 2212, and 2214 can be clustered together by attribute value to form a second group. Clustering algorithms can find natural separations in the attribute values to form these groups, and the number of groups can be less than or equal to the number available computing environments. Each of these cluster groups can then be matched to a computing environment ranked by attribute values.

Each of the methods described herein may be implemented by a specialized computer system. Each step of these methods may be executed automatically by the computer system, and/or may be provided with inputs/outputs involving a user. For example, a user may provide inputs for each step in a method, and each of these inputs may be in response to a specific output requesting such an input, wherein the output is generated by the computer system. Each input may be received in response to a corresponding requesting output. Furthermore, inputs may be received from a user, from another computer system as a data stream, retrieved from a memory location, retrieved over a network, requested from a web service, and/or the like. Likewise, outputs may be provided to a user, to another computer system as a data stream, saved in a memory location, sent over a network, provided to a web service, and/or the like. In short, each step of the methods described herein may be performed by a computer system, and may involve any number of inputs, outputs, and/or requests to and from the computer system which may or may not involve a user. Those steps not involving a user may be said to be performed automatically by the computer system without human intervention. Therefore, it will be understood in light of this disclosure, that each step of each method described herein may be altered to include an input and output to and from a user, or may be done automatically by a computer system without human intervention where any determinations are made by a processor. Furthermore, some embodiments of each of the methods described herein may be implemented as a set of instructions stored on a tangible, non-transitory storage medium to form a tangible software product.

FIG. 31 depicts a simplified diagram of a distributed system 3100 that may interact with any of the embodiments described above. In the illustrated embodiment, distributed system 3100 includes one or more client computing devices 3102, 3104, 3106, and 3108, which are configured to execute and operate a client application such as a web browser, proprietary client (e.g., Oracle Forms), or the like over one or more network(s) 3110. Server 3112 may be communicatively coupled with remote client computing devices 3102, 3104, 3106, and 3108 via network 3110.

In various embodiments, server 3112 may be adapted to run one or more services or software applications provided by one or more of the components of the system. In some embodiments, these services may be offered as web-based or cloud services or under a Software as a Service (SaaS) model to the users of client computing devices 3102, 3104, 3106, and/or 3108. Users operating client computing devices 3102, 3104, 3106, and/or 3108 may in turn utilize one or more client applications to interact with server 3112 to utilize the services provided by these components.

In the configuration depicted in the figure, the software components 3118, 3120 and 3122 of system 3100 are shown as being implemented on server 3112. In other embodiments, one or more of the components of system 3100 and/or the services provided by these components may also be implemented by one or more of the client computing devices 3102, 3104, 3106, and/or 3108. Users operating the client computing devices may then utilize one or more client applications to use the services provided by these components. These components may be implemented in hardware, firmware, software, or combinations thereof. It should be appreciated that various different system configurations are possible, which may be different from distributed system 3100. The embodiment shown in the figure is thus one example of a distributed system for implementing an embodiment system and is not intended to be limiting.

Client computing devices 3102, 3104, 3106, and/or 3108 may be portable handheld devices (e.g., an iPhone®, cellular telephone, an iPad®, computing tablet, a personal digital assistant (PDA)) or wearable devices (e.g., a Google Glass® head mounted display), running software such as Microsoft Windows Mobile®, and/or a variety of mobile operating systems such as iOS, Windows Phone, Android, BlackBerry 10, Palm OS, and the like, and being Internet, e-mail, short message service (SMS), Blackberry®, or other communication protocol enabled. The client computing devices can be general purpose personal computers including, by way of example, personal computers and/or laptop computers running various versions of Microsoft Windows®, Apple Macintosh®, and/or Linux operating systems. The client computing devices can be workstation computers limning any of a variety of commercially-available UNIX® or UNIX-like operating systems, including without limitation the variety of GNU/Linux operating systems, such as for example, Google Chrome OS. Alternatively, or in addition, client computing devices 3102, 3104, 3106, and 3108 may be any other electronic device, such as a thin-client computer, an Internet-enabled gaming system (e.g., a Microsoft Xbox gaming console with or without a Kinect® gesture input device), and/or a personal messaging device, capable of communicating over network(s) 3110.

Although exemplary distributed system 3100 is shown with four client computing devices, any number of client computing devices may be supported. Other devices, such as devices with sensors, etc., may interact with server 3112.

Network(s) 3110 in distributed system 3100 may be any type of network familiar to those skilled in the art that can support data communications using any of a variety of commercially-available protocols, including without limitation TCP/IP (transmission control protocol/Internet protocol), SNA (systems network architecture), IPX (Internet packet exchange), AppleTalk, and the like. Merely by way of example, network(s) 3110 can be a local area network (LAN), such as one based on Ethernet, Token-Ring and/or the like. Network(s) 3110 can be a wide-area network and the Internet. It can include a virtual network, including without limitation a virtual private network (VPN), an intranet, an extranet, a public switched telephone network (PSTN), an infra-red network, a wireless network (e.g., a network operating under any of the Institute of Electrical and Electronics (IEEE) 802.11 suite of protocols, Bluetooth®, and/or any other wireless protocol); and/or any combination of these and/or other networks.

Server 3112 may be composed of one or more general purpose computers, specialized server computers (including, by way of example, PC (personal computer) servers, UNIX® servers, mid-range servers, mainframe computers, rack-mounted servers, etc.), server farms, server clusters, or any other appropriate arrangement and/or combination. In various embodiment, server 3112 may be adapted to run one or more services or software applications described in the foregoing disclosure. For example, server 3112 may correspond to a server for performing processing described above according to an embodiment of the present disclosure.

Server 3112 may run an operating system including any of those discussed above, as well as any commercially available server operating system. Server 3112 may also run any of a variety of additional server applications and/or mid-tier applications, including HTTP (hypertext transport protocol) servers, FTP (file transfer protocol) servers, CGI (common gateway interface) servers, JAVA® servers, database servers, and the like. Exemplary database servers include without limitation those commercially available from Oracle, Microsoft, Sybase, IBM (International Business Machines), and the like.

In some implementations, server 3112 may include one or more applications to analyze and consolidate data feeds and/or event updates received from users of client computing devices 3102, 3104, 3106, and 3108. As an example, data feeds and/or event updates may include, but are not limited to, Twitter® feeds, Facebook® updates or real-time updates received from one or more third party information sources and continuous data streams, which may include real-time events related to sensor data applications, financial tickers, network performance measuring tools (e.g., network monitoring and traffic management applications), clickstream analysis tools, automobile traffic monitoring, and the like. Server 3112 may also include one or more applications to display the data feeds and/or real-time events via one or more display devices of client computing devices 3102, 3104, 3106, and 3108.

Distributed system 3100 may also include one or more databases 3114 and 3116. Databases 3114 and 3116 may reside in a variety of locations. By way of example, one or more of databases 3114 and 3116 may reside on a non-transitory storage medium local to (and/or resident in) server 3112. Alternatively, databases 3114 and 3116 may be remote from server 3112 and in communication with server 3112 via a network-based or dedicated connection. In one set of embodiments, databases 3114 and 3116 may reside in a storage-area network (SAN). Similarly, any necessary files for performing the functions attributed to server 3112 may be stored locally on server 3112 and/or remotely, as appropriate. In one set of embodiments, databases 3114 and 3116 may include relational databases, such as databases provided by Oracle, that are adapted to store, update, and retrieve data in response to SQL-formatted commands.

FIG. 32 is a simplified block diagram of one or more components of a system environment 3200 by which services provided by one or more components of an embodiment system may be offered as cloud services, in accordance with an embodiment of the present disclosure. In the illustrated embodiment, system environment 3200 includes one or more client computing devices 3204, 3206, and 3208 that may be used by users to interact with a cloud infrastructure system 3202 that provides cloud services. The client computing devices may be configured to operate a client application such as a web browser, a proprietary client application (e.g., Oracle Forms), or some other application, which may be used by a user of the client computing device to interact with cloud infrastructure system 3202 to use services provided by cloud infrastructure system 3202.

It should be appreciated that cloud infrastructure system 3202 depicted in the figure may have other components than those depicted. Further, the embodiment shown in the figure is only one example of a cloud infrastructure system that may incorporate an embodiment of the invention. In some other embodiments, cloud infrastructure system 3202 may have more or fewer components than shown in the figure, may combine two or more components, or may have a different configuration or arrangement of components.

Client computing devices 3204, 3206, and 3208 may be devices similar to those described above for 3102, 3104, 3106, and 3108.

Although exemplary system environment 3200 is shown with three client computing devices, any number of client computing devices may be supported. Other devices such as devices with sensors, etc. may interact with cloud infrastructure system 3202.

Network(s) 3210 may facilitate communications and exchange of data between clients 3204, 3206, and 3208 and cloud infrastructure system 3202. Each network may be any type of network familiar to those skilled in the art that can support data communications using any of a variety of commercially-available protocols, including those described above for network(s) 3110.

Cloud infrastructure system 3202 may comprise one or more computers and/or servers that may include those described above for server 3112.

In certain embodiments, services provided by the cloud infrastructure system may include a host of services that are made available to users of the cloud infrastructure system on demand, such as online data storage and backup solutions, Web-based e-mail services, hosted office suites and document collaboration services, database processing, managed technical support services, and the like. Services provided by the cloud infrastructure system can dynamically scale to meet the needs of its users. A specific instantiation of a service provided by cloud infrastructure system is referred to herein as a “service instance.” In general, any service made available to a user via a communication network, such as the Internet, from a cloud service provider's system is referred to as a “cloud service.” Typically, in a public cloud environment, servers and systems that make up the cloud service provider's system are different from the customer's own on-premises servers and systems. For example, a cloud service provider's system may host an application, and a user may, via a communication network such as the Internet, on demand, order and use the application.

In some examples, a service in a computer network cloud infrastructure may include protected computer network access to storage, a hosted database, a hosted web server, a software application, or other service provided by a cloud vendor to a user, or as otherwise known in the art. For example, a service can include password-protected access to remote storage on the cloud through the Internet. As another example, a service can include a web service-based hosted relational database and a script-language middleware engine for private use by a networked developer. As another example, a service can include access to an email software application hosted on a cloud vendor's web site.

In certain embodiments, cloud infrastructure system 3202 may include a suite of applications, middleware, and database service offerings that are delivered to a customer in a self-service, subscription-based, elastically scalable, reliable, highly available, and secure manner. An example of such a cloud infrastructure system is the Oracle Public Cloud provided by the present assignee.

In various embodiments, cloud infrastructure system 3202 may be adapted to automatically provision, manage and track a customer's subscription to services offered by cloud infrastructure system 3202. Cloud infrastructure system 3202 may provide the cloud services via different deployment models. For example, services may be provided under a public cloud model in which cloud infrastructure system 3202 is owned by an organization selling cloud services (e.g., owned by Oracle) and the services are made available to the general public or different industry enterprises. As another example, services may be provided under a private cloud model in which cloud infrastructure system 3202 is operated solely for a single organization and may provide services for one or more entities within the organization. The cloud services may also be provided under a community cloud model in which cloud infrastructure system 3202 and the services provided by cloud infrastructure system 3202 are shared by several organizations in a related community. The cloud services may also be provided under a hybrid cloud model, which is a combination of two or more different models.

In some embodiments, the services provided by cloud infrastructure system 3202 may include one or more services provided under Software as a Service (SaaS) category, Platform as a Service (PaaS) category, Infrastructure as a Service (IaaS) category, or other categories of services including hybrid services. A customer, via a subscription order, may order one or more services provided by cloud infrastructure system 3202. Cloud infrastructure system 3202 then performs processing to provide the services in the customer's subscription order.

In some embodiments, the services provided by cloud infrastructure system 3202 may include, without limitation, application services, platform services and infrastructure services. In some examples, application services may be provided by the cloud infrastructure system via a SaaS platform. The SaaS platform may be configured to provide cloud services that fall under the SaaS category. For example, the SaaS platform may provide capabilities to build and deliver a suite of on-demand applications on an integrated development and deployment platform. The SaaS platform may manage and control the underlying software and infrastructure for providing the SaaS services. By utilizing the services provided by the SaaS platform, customers can utilize applications executing on the cloud infrastructure system. Customers can acquire the application services without the need for customers to purchase separate licenses and support. Various different SaaS services may be provided. Examples include, without limitation, services that provide solutions for sales performance management, enterprise integration, and business flexibility for large organizations.

In some embodiments, platform services may be provided by the cloud infrastructure system via a PaaS platform. The PaaS platform may be configured to provide cloud services that fall under the PaaS category. Examples of platform services may include without limitation services that enable organizations (such as Oracle) to consolidate existing applications on a shared, common architecture, as well as the ability to build new applications that leverage the shared services provided by the platform. The PaaS platform may manage and control the underlying software and infrastructure for providing the PaaS services. Customers can acquire the PaaS services provided by the cloud infrastructure system without the need for customers to purchase separate licenses and support. Examples of platform services include, without limitation, Oracle Java Cloud Service (JCS), Oracle Database Cloud Service (DBCS), and others.

By utilizing the services provided by the PaaS platform, customers can employ programming languages and tools supported by the cloud infrastructure system and also control the deployed services. In some embodiments, platform services provided by the cloud infrastructure system may include database cloud services, middleware cloud services (e.g., Oracle Fusion Middleware services), and Java cloud services. In one embodiment, database cloud services may support shared service deployment models that enable organizations to pool database resources and offer customers a Database as a Service in the form of a database cloud. Middleware cloud services may provide a platform for customers to develop and deploy various business applications, and Java cloud services may provide a platform for customers to deploy Java applications, in the cloud infrastructure system.

Various different infrastructure services may be provided by an IaaS platform in the cloud infrastructure system. The infrastructure services facilitate the management and control of the underlying computing resources, such as storage, networks, and other fundamental computing resources for customers utilizing services provided by the SaaS platform and the PaaS platform.

In certain embodiments, cloud infrastructure system 3202 may also include infrastructure resources 3230 for providing the resources used to provide various services to customers of the cloud infrastructure system. In one embodiment, infrastructure resources 3230 may include pre-integrated and optimized combinations of hardware, such as servers, storage, and networking resources to execute the services provided by the PaaS platform and the SaaS platform.

In some embodiments, resources in cloud infrastructure system 3202 may be shared by multiple users and dynamically re-allocated per demand. Additionally, resources may be allocated to users in different time zones. For example, cloud infrastructure system 3230 may enable a first set of users in a first time zone to utilize resources of the cloud infrastructure system for a specified number of hours and then enable the re-allocation of the same resources to another set of users located in a different time zone, thereby maximizing the utilization of resources.

In certain embodiments, a number of internal shared services 3232 may be provided that are shared by different components or modules of cloud infrastructure system 3202 and by the services provided by cloud infrastructure system 3202. These internal shared services may include, without limitation, a security and identity service, an integration service, an enterprise repository service, an enterprise manager service, a virus scanning and white list service, a high availability, backup and recovery service, service for enabling cloud support, an email service, a notification service, a file transfer service, and the like.

In certain embodiments, cloud infrastructure system 3202 may provide comprehensive management of cloud services (e.g., SaaS, PaaS, and IaaS services) in the cloud infrastructure system. In one embodiment, cloud management functionality may include capabilities for provisioning, managing and tracking a customer's subscription received by cloud infrastructure system 3202, and the like.

In one embodiment, as depicted in the figure, cloud management functionality may be provided by one or more modules, such as an order management module 3220, an order orchestration module 3222, an order provisioning module 3224, an order management and monitoring module 3226, and an identity management module 3228. These modules may include or be provided using one or more computers and/or servers, which may be general purpose computers, specialized server computers, server farms, server clusters, or any other appropriate arrangement and/or combination.

In exemplary operation 3234, a customer using a client device, such as client device 3204, 3206 or 3208, may interact with cloud infrastructure system 3202 by requesting one or more services provided by cloud infrastructure system 3202 and placing an order for a subscription for one or more services offered by cloud infrastructure system 3202. In certain embodiments, the customer may access a cloud User Interface (UI), cloud UI 3212, cloud UI 3214 and/or cloud UI 3216 and place a subscription order via these UIs. The order information received by cloud infrastructure system 3202 in response to the customer placing an order may include information identifying the customer and one or more services offered by the cloud infrastructure system 3202 that the customer intends to subscribe to.

After an order has been placed by the customer, the order information is received via the cloud UIs, 3212, 3214 and/or 3216.

At operation 3236, the order is stored in order database 3218. Order database 3218 can be one of several databases operated by cloud infrastructure system 3218 and operated in conjunction with other system elements.

At operation 3238, the order information is forwarded to an order management module 3220. In some instances, order management module 3220 may be configured to perform billing and accounting functions related to the order, such as verifying the order, and upon verification, booking the order.

At operation 3240, information regarding the order is communicated to an order orchestration module 3222. Order orchestration module 3222 may utilize the order information to orchestrate the provisioning of services and resources for the order placed by the customer. In some instances, order orchestration module 3222 may orchestrate the provisioning of resources to support the subscribed services using the services of order provisioning module 3224.

In certain embodiments, order orchestration module 3222 enables the management of business processes associated with each order and applies business logic to determine whether an order should proceed to provisioning. At operation 3242, upon receiving an order for a new subscription, order orchestration module 3222 sends a request to order provisioning module 3224 to allocate resources and configure those resources needed to fulfill the subscription order. Order provisioning module 3224 enables the allocation of resources for the services ordered by the customer. Order provisioning module 3224 provides a level of abstraction between the cloud services provided by cloud infrastructure system 3200 and the physical implementation layer that is used to provision the resources for providing the requested services. Order orchestration module 3222 may thus be isolated from implementation details, such as whether or not services and resources are actually provisioned on the fly or pre-provisioned and only allocated/assigned upon request.

At operation 3244, once the services and resources are provisioned, a notification of the provided service may be sent to customers on client devices 3204, 3206 and/or 3208 by order provisioning module 3224 of cloud infrastructure system 3202.

At operation 3246, the customer's subscription order may be managed and tracked by an order management and monitoring module 3226. In some instances, order management and monitoring module 3226 may be configured to collect usage statistics for the services in the subscription order, such as the amount of storage used, the amount data transferred, the number of users, and the amount of system up time and system down time.

In certain embodiments, cloud infrastructure system 3200 may include an identity management module 3228. Identity management module 3228 may be configured to provide identity services, such as access management and authorization services in cloud infrastructure system 3200. In some embodiments, identity management module 3228 may control information about customers who wish to utilize the services provided by cloud infrastructure system 3202. Such information can include information that authenticates the identities of such customers and information that describes which actions those customers are authorized to perform relative to various system resources (e.g., files, directories, applications, communication ports, memory segments, etc.) Identity management module 3228 may also include the management of descriptive information about each customer and about how and by whom that descriptive information can be accessed and modified.

FIG. 33 illustrates an exemplary computer system 3300, in which various embodiments of the present invention may be implemented. The system 3300 may be used to implement any of the computer systems described above. As shown in the figure, computer system 3300 includes a processing unit 3304 that communicates with a number of peripheral subsystems via a bus subsystem 3302. These peripheral subsystems may include a processing acceleration unit 3306, an I/O subsystem 3308, a storage subsystem 3318 and a communications subsystem 3324. Storage subsystem 3318 includes tangible computer-readable storage media 3322 and a system memory 3310.

Bus subsystem 3302 provides a mechanism for letting the various components and subsystems of computer system 3300 communicate with each other as intended. Although bus subsystem 3302 is shown schematically as a single bus, alternative embodiments of the bus subsystem may utilize multiple buses. Bus subsystem 3302 may be any of several types of bus structures including a memory bus or memory controller, a peripheral bus, and a local bus using any of a variety of bus architectures. For example, such architectures may include an Industry Standard Architecture (ISA) bus, Micro Channel Architecture (MCA) bus, Enhanced ISA (EISA) bus, Video Electronics Standards Association (VESA) local bus, and Peripheral Component Interconnect (PCI) bus, which can be implemented as a Mezzanine bus manufactured to the IEEE P1386.1 standard.

Processing unit 3304, which can be implemented as one or more integrated circuits (e.g., a conventional microprocessor or microcontroller), controls the operation of computer system 3300. One or more processors may be included in processing unit 3304. These processors may include single core or multicore processors. In certain embodiments, processing unit 3304 may be implemented as one or more independent processing units 3332 and/or 3334 with single or multicore processors included in each processing unit. In other embodiments, processing unit 3304 may also be implemented as a quad-core processing unit formed by integrating two dual-core processors into a single chip.

In various embodiments, processing unit 3304 can execute a variety of programs in response to program code and can maintain multiple concurrently executing programs or processes. At any given time, some or all of the program code to be executed can be resident in processor(s) 3304 and/or in storage subsystem 3318. Through suitable programming, processor(s) 3304 can provide various functionalities described above. Computer system 3300 may additionally include a processing acceleration unit 3306, which can include a digital signal processor (DSP), a special-purpose processor, and/or the like.

I/O subsystem 3308 may include user interface input devices and user interface output devices. User interface input devices may include a keyboard, pointing devices such as a mouse or trackball, a touchpad or touch screen incorporated into a display, a scroll wheel, a click wheel, a dial, a button, a switch, a keypad, audio input devices with voice command recognition systems, microphones, and other types of input devices. User interface input devices may include, for example, motion sensing and/or gesture recognition devices such as the Microsoft Kinect® motion sensor that enables users to control and interact with an input device, such as the Microsoft Xbox® 360 game controller, through a natural user interface using gestures and spoken commands. User interface input devices may also include eye gesture recognition devices such as the Google Glass® blink detector that detects eye activity (e.g., ‘blinking’ while taking pictures and/or making a menu selection) from users and transforms the eye gestures as input into an input device (e.g., Google Glass®). Additionally, user interface input devices may include voice recognition sensing devices that enable users to interact with voice recognition systems (e.g., Ski® navigator), through voice commands.

User interface input devices may also include, without limitation, three dimensional (3D) mice, joysticks or pointing sticks, gamepads and graphic tablets, and audio/visual devices such as speakers, digital cameras, digital camcorders, portable media players, webcams, image scanners, fingerprint scanners, barcode reader 3D scanners, 3D printers, laser rangefinders, and eye gaze tracking devices. Additionally, user interface input devices may include, for example, medical imaging input devices such as computed tomography, magnetic resonance imaging, position emission tomography, medical ultrasonography devices. User interface input devices may also include, for example, audio input devices such as MIDI keyboards, digital musical instruments and the like.

User interface output devices may include a display subsystem, indicator lights, or non-visual displays such as audio output devices, etc. The display subsystem may be a cathode ray tube (CRT), a flat-panel device, such as that using a liquid crystal display (LCD) or plasma display, a projection device, a touch screen, and the like. In general, use of the term “output device” is intended to include all possible types of devices and mechanisms for outputting information from computer system 3300 to a user or other computer. For example, user interface output devices may include, without limitation, a variety of display devices that visually convey text, graphics and audio/video information such as monitors, printers, speakers, headphones, automotive navigation systems, plotters, voice output devices, and modems.

Computer system 3300 may comprise a storage subsystem 3318 that comprises software elements, shown as being currently located within a system memory 3310. System memory 3310 may store program instructions that are loadable and executable on processing unit 3304, as well as data generated during the execution of these programs.

Depending on the configuration and type of computer system 3300, system memory 3310 may be volatile (such as random access memory (RAM)) and/or non-volatile (such as read-only memory (ROM), flash memory, etc.) The RAM typically contains data and/or program modules that are immediately accessible to and/or presently being operated and executed by processing unit 3304. In some implementations, system memory 3310 may include multiple different types of memory, such as static random access memory (SRAM) or dynamic random access memory (DRAM). In some implementations, a basic input/output system (BIOS), containing the basic routines that help to transfer information between elements within computer system 3300, such as during start-up, may typically be stored in the ROM. By way of example, and not limitation, system memory 3310 also illustrates application programs 3312, which may include client applications, Web browsers, mid-tier applications, relational database management systems (RDBMS), etc., program data 3314, and an operating system 3316. By way of example, operating system 3316 may include various versions of Microsoft Windows®, Apple Macintosh®, and/or Linux operating systems, a variety of commercially-available UNIX® or UNIX-like operating systems (including without limitation the variety of GNU/Linux operating systems, the Google Chrome® OS, and the like) and/or mobile operating systems such as iOS, Windows® Phone, Android® OS, BlackBerry® 10 OS, and Palm® OS operating systems.

Storage subsystem 3318 may also provide a tangible computer-readable storage medium for storing the basic programming and data constructs that provide the functionality of some embodiments. Software (programs, code modules, instructions) that when executed by a processor provide the functionality described above may be stored in storage subsystem 3318. These software modules or instructions may be executed by processing unit 3304. Storage subsystem 3318 may also provide a repository for storing data used in accordance with the present invention.

Storage subsystem 3300 may also include a computer-readable storage media reader 3320 that can further be connected to computer-readable storage media 3322. Together and, optionally, in combination with system memory 3310, computer-readable storage media 3322 may comprehensively represent remote, local, fixed, and/or removable storage devices plus storage media for temporarily and/or more permanently containing, storing, transmitting, and retrieving computer-readable information.

Computer-readable storage media 3322 containing code, or portions of code, can also include any appropriate media known or used in the art, including storage media and communication media, such as but not limited to, volatile and non-volatile, removable and non-removable media implemented in any method or technology for storage and/or transmission of information. This can include tangible computer-readable storage media such as RAM, ROM, electronically erasable programmable ROM (EEPROM), flash memory or other memory technology, CD-ROM, digital versatile disk (DVD), or other optical storage, magnetic cassettes, magnetic tape, magnetic disk storage or other magnetic storage devices, or other tangible computer readable media. This can also include nontangible computer-readable media, such as data signals, data transmissions, or any other medium which can be used to transmit the desired information and which can be accessed by computing system 3300.

By way of example, computer-readable storage media 3322 may include a hard disk drive that reads from or writes to non-removable, nonvolatile magnetic media, a magnetic disk drive that reads from or writes to a removable, nonvolatile magnetic disk, and an optical disk drive that reads from or writes to a removable, nonvolatile optical disk such as a CD ROM, DVD, and Blu-Ray® disk, or other optical media. Computer-readable storage media 3322 may include, but is not limited to, Zip® drives, flash memory cards, universal serial bus (USB) flash drives, secure digital (SD) cards, DVD disks, digital video tape, and the like. Computer-readable storage media 3322 may also include, solid-state drives (SSD) based on non-volatile memory such as flash-memory based SSDs, enterprise flash drives, solid state ROM, and the like, SSDs based on volatile memory such as solid state RAM, dynamic RAM, static RAM, DRAM-based SSDs, magnetoresistive RAM (MRAM) SSDs, and hybrid SSDs that use a combination of DRAM and flash memory based SSDs. The disk drives and their associated computer-readable media may provide non-volatile storage of computer-readable instructions, data structures, program modules, and other data for computer system 3300.

Communications subsystem 3324 provides an interface to other computer systems and networks. Communications subsystem 3324 serves as an interface for receiving data from and transmitting data to other systems from computer system 3300. For example, communications subsystem 3324 may enable computer system 3300 to connect to one or more devices via the Internet. In some embodiments communications subsystem 3324 can include radio frequency (RF) transceiver components for accessing wireless voice and/or data networks (e.g., using cellular telephone technology, advanced data network technology, such as 3G, 4G or EDGE (enhanced data rates for global evolution), WiFi (IEEE 802.11 family standards, or other mobile communication technologies, or any combination thereof), global positioning system (GPS) receiver components, and/or other components. In some embodiments communications subsystem 3324 can provide wired network connectivity (e.g., Ethernet) in addition to or instead of a wireless interface.

In some embodiments, communications subsystem 3324 may also receive input communication in the form of structured and/or unstructured data feeds 3326, event streams 3328, event updates 3330, and the like on behalf of one or more users who may use computer system 3300.

By way of example, communications subsystem 3324 may be configured to receive data feeds 3326 in real-time from users of social networks and/or other communication services such as Twitter® feeds, Facebook® updates, web feeds such as Rich Site Summary (RSS) feeds, and/or real-time updates from one or more third party information sources.

Additionally, communications subsystem 3324 may also be configured to receive data in the form of continuous data streams, which may include event streams 3328 of real-time events and/or event updates 3330, that may be continuous or unbounded in nature with no explicit end. Examples of applications that generate continuous data may include, for example, sensor data applications, financial tickers, network performance measuring tools (e.g. network monitoring and traffic management applications), clickstream analysis tools, automobile traffic monitoring, and the like.

Communications subsystem 3324 may also be configured to output the structured and/or unstructured data feeds 3326, event streams 3328, event updates 3330, and the like to one or more databases that may be in communication with one or more streaming data source computers coupled to computer system 3300.

Computer system 3300 can be one of various types, including a handheld portable device (e.g., an iPhone® cellular phone, an iPad® computing tablet, a PDA), a wearable device (e.g., a Google Glass® head mounted display), a PC, a workstation, a mainframe, a kiosk, a server rack, or any other data processing system.

Due to the ever-changing nature of computers and networks, the description of computer system 3300 depicted in the figure is intended only as a specific example. Many other configurations having more or fewer components than the system depicted in the figure are possible. For example, customized hardware might also be used and/or particular elements might be implemented in hardware, firmware, software (including applets), or a combination. Further, connection to other computing devices, such as network input/output devices, may be employed. Based on the disclosure and teachings provided herein, a person of ordinary skill in the art will appreciate other ways and/or methods to implement the various embodiments.

In the foregoing description, for the purposes of explanation, numerous specific details were set forth in order to provide a thorough understanding of various embodiments of the present invention. It will be apparent, however, to one skilled in the art that embodiments of the present invention may be practiced without some of these specific details. In other instances, well-known structures and devices are shown in block diagram form.

The foregoing description provides exemplary embodiments only, and is not intended to limit the scope, applicability, or configuration of the disclosure. Rather, the foregoing description of the exemplary embodiments will provide those skilled in the art with an enabling description for implementing an exemplary embodiment. It should be understood that various changes may be made in the function and arrangement of elements without departing from the spirit and scope of the invention as set forth in the appended claims.

Specific details are given in the foregoing description to provide a thorough understanding of the embodiments. However, it will be understood by one of ordinary skill in the art that the embodiments may be practiced without these specific details. For example, circuits, systems, networks, processes, and other components may have been shown as components in block diagram form in order not to obscure the embodiments in unnecessary detail. In other instances, well-known circuits, processes, algorithms, structures, and techniques may have been shown without unnecessary detail in order to avoid obscuring the embodiments.

Also, it is noted that individual embodiments may have been described as a process which is depicted as a flowchart, a flow diagram, a data flow diagram, a structure diagram, or a block diagram. Although a flowchart may have described the operations as a sequential process, many of the operations can be performed in parallel or concurrently. In addition, the order of the operations may be re-arranged. A process is terminated when its operations are completed, but could have additional steps not included in a figure. A process may correspond to a method, a function, a procedure, a subroutine, a subprogram, etc. When a process corresponds to a function, its termination can correspond to a return of the function to the calling function or the main function.

The term “computer-readable medium” includes, but is not limited to portable or fixed storage devices, optical storage devices, wireless channels and various other mediums capable of storing, containing, or carrying instruction(s) and/or data. A code segment or machine-executable instructions may represent a procedure, a function, a subprogram, a program, a routine, a subroutine, a module, a software package, a class, or any combination of instructions, data structures, or program statements. A code segment may be coupled to another code segment or a hardware circuit by passing and/or receiving information, data, arguments, parameters, or memory contents. Information, arguments, parameters, data, etc., may be passed, forwarded, or transmitted via any suitable means including memory sharing, message passing, token passing, network transmission, etc.

Furthermore, embodiments may be implemented by hardware, software, firmware, middleware, microcode, hardware description languages, or any combination thereof. When implemented in software, firmware, middleware or microcode, the program code or code segments to perform the necessary tasks may be stored in a machine readable medium. A processor(s) may perform the necessary tasks.

In the foregoing specification, aspects of the invention are described with reference to specific embodiments thereof, but those skilled in the art will recognize that the invention is not limited thereto. Various features and aspects of the above-described invention may be used individually or jointly. Further, embodiments can be utilized in any number of environments and applications beyond those described herein without departing from the broader spirit and scope of the specification. The specification and drawings are, accordingly, to be regarded as illustrative rather than restrictive.

Additionally, for the purposes of illustration, methods were described in a particular order. It should be appreciated that in alternate embodiments, the methods may be performed in a different order than that described. It should also be appreciated that the methods described above may be performed by hardware components or may be embodied in sequences of machine-executable instructions, which may be used to cause a machine, such as a general-purpose or special-purpose processor or logic circuits programmed with the instructions to perform the methods. These machine-executable instructions may be stored on one or more machine readable mediums, such as CD-ROMs or other type of optical disks, floppy diskettes, ROMs, RAMs, EPROMs, EEPROMs, magnetic or optical cards, flash memory, or other types of machine-readable mediums suitable for storing electronic instructions. Alternatively, the methods may be performed by a combination of hardware and software. 

What is claimed is:
 1. A method of distributing microservice containers for a service across a plurality of computing environments, the method comprising: receiving a service that is built from a plurality of containerized microservices to be deployed in a container platform, wherein the container platform comprises a plurality of computing environments; receiving a deployment criteria for deploying the service in the container platform; accessing characteristics of the plurality of computing environments; and deploying the plurality of containerized micro services across the plurality of computing environments based on the deployment criteria and the characteristics of the plurality of computing environments.
 2. The method of claim 1, wherein the plurality of computing environments comprises a cloud computing environment.
 3. The method of claim 1, wherein the plurality of computing environments comprises an on-premise computing environment.
 4. The method of claim 1, wherein at least one of the plurality of containerized micro services comprises a plurality of containers in a single container pod.
 5. The method of claim 1, wherein at least one of the plurality of containerized micro services comprises a single container comprising a plurality of micro-services in a single container pod.
 6. The method of claim 1, wherein the deployment criteria comprises an indication that the service deployment should be optimized for security.
 7. The method of claim 1, wherein the deployment criteria comprises an indication that the service deployment should be optimized for availability.
 8. The method of claim 1, wherein the deployment criteria comprises an indication that the service deployment should be optimized for cost.
 9. The method of claim 1, wherein the deployment criteria comprises an indication that the service deployment should be optimized for latency.
 10. The method of claim 1, wherein the deployment criteria comprises an indication that the service deployment should be optimized for co-location of the containerized micro services.
 11. A non-transitory, computer-readable medium comprising instructions that, when executed by one or more processors, causes the one or more processors to perform operations comprising: receiving a service that is built from a plurality of containerized microservices to be deployed in a container platform, wherein the container platform comprises a plurality of computing environments; receiving a deployment criteria for deploying the service in the container platform; accessing characteristics of the plurality of computing environments; and deploying the plurality of containerized micro services across the plurality of computing environments based on the deployment criteria and the characteristics of the plurality of computing environments.
 12. The non-transitory, computer-readable medium of claim 11, wherein each of the plurality of containerized micro services comprises an attribute corresponding to the deployment criteria.
 13. The non-transitory, computer-readable medium of claim 12, wherein values for the micro service attributes are matched with the characteristics of the plurality of computing environments.
 14. The non-transitory, computer-readable medium of claim 11, wherein the plurality of containerized micro services are deployed by a container platform scheduler.
 15. A system comprising: one or more processors; and one or more memory devices comprising instructions that, when executed by the one or more processors, cause the one or more processors to perform operations comprising: receiving a service that is built from a plurality of containerized microservices to be deployed in a container platform, wherein the container platform comprises a plurality of computing environments; receiving a deployment criteria for deploying the service in the container platform; accessing characteristics of the plurality of computing environments; and deploying the plurality of containerized micro services across the plurality of computing environments based on the deployment criteria and the characteristics of the plurality of computing environments.
 16. The system of claim 15, wherein the plurality of containerized micro services are deployed by an API registry.
 17. The system of claim 16, wherein the API registry is deployed as a service encapsulated in a container in the container environment.
 18. The system of claim 16, wherein the API registry is available to: services in development in an Integrated Development Environment (IDE); and services already deployed in the container environment.
 19. The system of claim 16, wherein the API registry maps service endpoints for the plurality of container pods to one or more API functions.
 20. The system of claim 16, wherein the API registry automatically generates client libraries for calling services to call the service. 